• Models

  • Datasets

Open-Source AI Models

A suite of high-quality, open-source models developed for the Tunga Agricultural Chatbot in collaboration with Rwanda C4IR, KiNLP, and MINAGRI.

Text-to-Speech (TTS)

Currently the highest-quality synthetic voice for Kinyarwanda, enabling natural-sounding interactions for voice-based applications.

View Model

KinyaCOLBERT Free

The first Kinyarwanda embedding and information retrieval model. Essential for building RAG-based chatbots like Tunga.

View Model

KinyaBERT

A foundational BERT-style model serving as the backbone for various Kinyarwanda NLP tasks, from classification to entity recognition.

View Model

Open-Source Datasets

The curated datasets used to train our foundational models, released to support the growth of the Kinyarwanda AI ecosystem.

Voice Dataset

A comprehensive dataset designed for training high-fidelity Text-to-Speech (TTS) and Speech-to-Text (STT) models in Kinyarwanda.

Access Data

Information Retrieval Dataset

The specialized dataset used to train the KinyaColBERT model, optimized for semantic search and RAG applications.

Access Data