Join our mailing list to get updates on our events, news, and the latest from the world of African language resources.

Your email is safe with us. We promise not to spam!
Please, consider giving your feedback on using Lanfrica so that we can know how best to serve you. To get started, .
X
Filter

Filter Records

Languages

Loading...

Tasks

Loading...

Record Types

Loading...

Tags

Loading...

AfroLID is a powerful neural toolkit for African languages identification which covers 517 African languages....

Expand Abstract

Language identification (LID) is a crucial precursor for NLP, especially for mining web data. Problematically, most of the world's 7000+ languages today are not covered by LID technologies. We address this pressing issue for Africa by introducing AfroLID, a neural ...

Expand Abstract

While Proto Kru and many languages on both sides of the East-West divide to- day show a set of 9 oral vowels, a subset of Eastern Kru languages attests a much higher inventory, with up to five distinctive central vowels, resulting in a thir- teen vowel +ATR set. Th...

Expand Abstract

A dataset of over 700 different languages providing audio, aligned text and word pronunciations. On average each language provides around 20 hours of sentence-lengthed transcriptions. Data is mined from read New Testaments from http://www.bible.is/

This paper describes the CMU Wilderness Multilingual Speech Dataset. A dataset of over 700 different languages providing audio, aligned text and word pronunciations. On average each language provides around 20 hours of sentence-lengthed transcriptions. We describe ...

Expand Abstract

This paper analyzes Mande data that suggest a grammaticalization path leading from the imperative of ‘see/look’ verbs to ostensive predicators (i.e. words functionally similar to French voici, Italian ecco, or Russian vot), and further to copulas. Clear cases of co...

Expand Abstract

Founded in 1988, the Folio Group has grown from a tiny start-up into the major-league language service provider that it is today. This is largely driven by our reputation for reliability, technical expertise, fast turnaround and meticulous accuracy. Folio is recogn...

Expand Abstract

Gboard is a virtual keyboard app developed by Google for Android and iOS devices.

Keyword spotting refers to the task of learning to detect spoken keywords. It interfaces all modern voice-based virtual assistants on the market: Amazon’s Alexa, Apple’s Siri, and the Google Home device. Contrarily to speech recognition models, keyword spotting doe...

Expand Abstract

The Mandla dictionary is an online open source crowd sourced dictionary for African languages. It features definitions, etymology, and example sentences written in the native language using both indigenous scripts and latin script, as well as parallel definitions w...

Expand Abstract

Mandla is a language learning app for African languages

N'Ko (N'Ko: ߒߞߏ) is a script devised by Solomana Kante in 1949, as a modern writing system for the Mandé languages of West Africa. The term N'Ko, which means I say in all Mandé languages, is also used for the Mandé literary standard written in N'Ko script. The scri...

Expand Abstract

PanLex, a project of The Long Now Foundation, aims to enable the translation of lexemes among all human languages in the world. By focusing on lexemic translations, rather than grammatical or corpus data, it achieves broader lexical and language coverage than relat...

Expand Abstract

PanLex is a nonprofit whose mission is to overcome language barriers to human rights, information, and opportunities. We believe that nobody should have their rights restricted because of the language they speak.