Password reset email sent. Please check your inbox for the reset link.
One last step to join the waitlist! We've sent a verification link to your inbox. Please check your email and click the link to confirm your spot on the waitlist.
Don't see it? Check your spam folder or contact support if you need assistance.
A suite of high-quality, open-source models developed for African languages, data, and use cases.
Currently the highest-quality synthetic voice for Kinyarwanda, enabling natural-sounding interactions for voice-based applications.
View ModelThe first Kinyarwanda embedding and information retrieval model. Essential for building RAG-based chatbots like Tunga.
View ModelA foundational BERT-style model serving as the backbone for various Kinyarwanda NLP tasks, from classification to entity recognition.
View ModelAn 8B parameter model by Jacaranda specifically enhanced to excel in processing and generating text in Swahili.
View ModelAn 8B parameter model by Jacaranda specifically enhanced to excel in processing and generating text in the Yoruba language.
View ModelAn 8B parameter model by Jacaranda specifically enhanced to excel in processing and generating text in the Hausa language.
View ModelAn 8B parameter model by Jacaranda specifically enhanced to excel in processing and generating text in isiXhosa and isiZulu.
View ModelAn 8B parameter multi-language model by Jacaranda enhanced for Swahili, Xhosa, Zulu, Yoruba, and Hausa.
View ModelA 7B parameter Swahili language model by Jacaranda continually pre-trained on Swahili instructions.
View ModelA Swahili foundational model continually pre-trained to extend the capabilities of Llama 2.
View ModelBrowse open-source models from the Masakhane community spanning multiple African languages and NLP tasks.
View ModelsCurated open datasets used to train and evaluate foundational models across Kinyarwanda and other African languages.
A comprehensive dataset designed for training high-fidelity Text-to-Speech (TTS) and Speech-to-Text (STT) models in Kinyarwanda.
Access DataThe specialized dataset used to train the KinyaColBERT model, optimized for semantic search and RAG applications.
Access DataA multilingual reasoning dataset containing 50,000 synthetic reasoning examples across 10 African languages.
Access DataExplore multilingual datasets from Masakhane covering translation, speech, and NLP resources for African languages.
Access Data