
Isaka James
486 posts




Introducing the Swahili Thinking Dataset. Excited to release the first open-source chain-of-thought reasoning dataset for Swahili. Following OpenAI's Harmony response format, the dataset comprises of high-quality Swahili conversational AI responses along with their chain-of-thought. While such datasets exist for English, French, Spanish, e.t.c, there were no publicly accessible high-quality reasoning datasets for African languages. Until now!! This dataset enables researchers and developers to build Swahili language models with native reasoning capabilities, advancing AI for 200+ million Swahili speakers. Release announcement: nadhari.ai/swahili-thinki… Dataset: huggingface.co/datasets/Nadha… The dataset built upon the excellent work by @huggingface H4's Multilingual-Thinking dataset. We intend to extend the dataset in the future and we welcome further contributions to the dataset.

1 yr ago, I built cpage.co.tz to solve "university assignment" coverpage creation & "nearby stationery shops" printing issues. 📄 Posted on TikTok & it blew up. 📈 Been a wild, blessed ride so far and I am grateful! 🙏 Devs, pls check it out & share! 🖥️🤜💥🤛


Google’s Nano Banana Pro is by far the best image generation AI out there. I gave it a picture of a question and it solved it correctly in my actual handwriting. Students are going to love this. 😂













