Join our mailing list to get updates on our events, news, and the latest from the world of African language resources.

Your email is safe with us. We promise not to spam!
Please, consider giving your feedback on using Lanfrica so that we can know how best to serve you. To get started, .
X
Filter

Filter Records

Languages

Loading...

Tasks

Loading...

Record Types

Loading...

Tags

Loading...

The African Storybook (ASb) is a multilingual literacy initiative that works with educators and children to publish openly licensed picture storybooks for early reading in the languages of Africa. An initiative of Saide, the ASb has an interactive website that enab...

Expand Abstract

The story of Linguarena I am Samba Kamara, founder of Linguarena. In 2009, before a trip to Dakar (Senegal), I decided to learn wolof language. It was not my first trip to Senegal but this time I wanted to be able to talk with locals in wolof. I was used to l...

Expand Abstract

Synchronic studies on Swahili adnominal demonstratives have not addressed the interplay between syntactic position and pragmatic function of these structures. This study shows how referential givenness of discourse entities may explain Swahili word order variation ...

Expand Abstract

Recent advances in the pre-training of language models leverage large-scale datasets to create multilingual models. However, low-resource languages are mostly left out in these datasets. This is primarily because many widely spoken languages are not well represente...

Expand Abstract

Msamiati wa Teknolojia Dijitali - Vocabulary of Digital Technology

We use the multilingual OSCAR corpus, extracted from Common Crawl via language classification, filtering and cleaning, to train monolingual contextualized word embeddings (ELMo) for five mid-resource languages. We then compare the performance of OSCAR-based and Wik...

Expand Abstract

MAD-X adapters trained on AfroXLMR-base, it has the same configuration as XLMR-base....

Expand Abstract

Multilingual pre-trained language models (PLMs) have demonstrated impressive performance on several downstream tasks for both high-resourced and low-resourced languages. However, there is still a large performance drop for languages unseen during pre-training, espe...

Expand Abstract

This repository contains the code for the paper Small Data? No Problem! Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages which appears in the first workshop on Multilingual Representation Learning at EMNLP 2021. AfriBE...

Expand Abstract

African Alphabets of the Bayreuth Cluster (AABC) provides mobiles and desktop PCs (Windows and iOS) with keyboards for all common African languages and scripts. More information and the download link for the Mac/Windows program can be found at the bottom of this pa...

Expand Abstract

African Voices is a collaborative project that aims to collects high-quality speech (tts) datasets and synthesizers for all African languages. You can search datasets and synthesizers by language. You can also synthesize text from your synthesizer of choice. Additi...

Expand Abstract

AfriCLIRMatrix is a test collection for cross-lingual information retrieval research in 15 diverse African languages. This resource comprises English queries with query–document relevance judgments in 15 African languages automatically mined from Wikipedia...

Expand Abstract

Language diversity in NLP is critical in enabling the development of tools for a wide range of users.However, there are limited resources for building such tools for many languages, particularly those spoken in Africa.For search, most existing datasets feature few ...

Expand Abstract

AfriSenti is the largest sentiment analysis dataset for under-represented African languages, covering 110,000+ annotated tweets in 14 African languages (Amharic, Algerian Arabic, Hausa, Igbo, Kinyarwanda, Moroccan Arabic, Mozambican Portuguese, Nigerian Pidgin, Oro...

Expand Abstract

Africa is home to over 2000 languages from over six language families and has the highest linguistic diversity among all continents. This includes 75 languages with at least one million speakers each. Yet, there is little NLP research conducted on African languages...

Expand Abstract