The State of African Language Datasets

Motivation Our digital world is a rich tapestry of ideas, languages, cultures, and knowledge. However, our access to and understanding of these resources is skewed; some gain significant visibility, while others remain under-represented, and obscure (even when available on the web). Our understanding is largely defined by what’s findable. In today’s fast-paced digital age, online […]
Your Voice Matters: Join the TWB Voice Project and Make A Difference

At Lanfrica, we believe that language is inherent for human communication and communication. While many language technology efforts are based on text, truly connecting the tapestry of African languages requires building technologies that truly support Africans, requires being able to connect them via their major medium of communication – audio. Therefore, enabling spoken language technology […]
Mapping Nigeria’s Digital Language Landscape: Community Sprint

First things to do: Background: Web Languages Project Welcome! This is a crowd-sourced effort to improve crawling of low-resource languages. This dataset is public. Common Crawl recognizes a lot of languages, and we can see that we don’t have enough of languages. We are interested in languages from all over the world. If you choose […]
Community Engagement Curation at Lanfrica

We are excited to welcome Elizabeth Chikapa to Lanfrica as our Community Engagement Curator (CEC)! Her passion for community building, enthusiasm, skills, and dedication truly stand out and will play a pivotal role in expanding our reach and making a positive impact on the “underground communities” we aim to serve. Motivation Lanfrica is on a […]
We Ain’t Just Cooking Jollof Rice, We Are Building AfricaNLP

There is more to Africa than the varieties of Jollof Rice delicacies or the melodious and vibeable tunes of Afrobeats. There are African languages! With the rise of Natural Language Processing, attention has been heavily placed on western languages with several discoveries and advancements that make these languages highly resourced. However, one may ask or […]