Lanfrica

Filter

Filter Records

Languages

Lingala64
Swahili178
Yoruba165
Amharic153
Hausa147
Igbo136
Afrikaans132
Zulu127
Somali117
Xhosa111
Kinyarwanda88
Setswana81
Wolof78
Sotho, Northern74
Ganda73
Sotho, Southern71
Tigrigna70
Pidgin, Nigerian65
Tsonga57
Shona57
Swati55
Rundi50
Fon48
Venda48
Arabic, Egyptian Spoken45
Akan45
Chichewa45
Bamanankan44
Dholuo40
Swahili, Congo36
Éwé35
Swahili, Coastal35
Ndebele34
Ndebele33
Bemba30
Dinka, Southeastern29
Bwamu, Cwi29
Mòoré29
Amazigh28
Arabic, Moroccan Spoken26
Gikuyu26
Umbundu24
Urhobo24
Tamasheq22
Arabic, Tunisian Spoken22
Sango22
Fulfulde, Nigerian21
Nuer20
Ibibio19
Jula19
Mbay19
Nyankore18
Kimbundu18
Kanuri, Yerwa18
Acholi17
Tumbuka17
Kamba17
Ateso17
Luba-Kasai16
Kabuverdianu16
Arabic, Algerian Spoken16
Pol16
Dan16
Oshiwambo16
Oromo, Borana-Arsi-Guji16
Kabiyè16
Mandinka16
Dinka, Southwestern16
Malagasy, Merina16
Efik15
Susu15
Pular15
Tha15
Fulfulde, Western Niger15
Fulfulde, Adamawa15
Fulfulde, Central-Eastern Niger15
Lugbara14
Ko14
Ron14
Ga14
Kaan14
Kam14
Chokwe14
Zarma14
Aka14
Soga14
Koongo13
Kanuri, Manga13
Bukusu13
Krio13
Oromo, West Central13
Coptic13
Afar13
Pidgin, Cameroon13
Ewondo13
Kituba13
Serer-Sine13
Tonga13
Lozi13
Khoekhoe13
Male12
Morisyen12
Themne12
Jiru12
Herero12
Dagbani12
Dangme12
Masaaba12
Kasem11
Pévé11
Ndonga11
Seychelles French Creole11
Makonde11
Nande11
Tiv11
Gourmanchéma11
Maninkakan, Eastern11
Maasai11
Nyakyusa-Ngonde11
Kenyang11
Kanembu11
Makhuwa11
Geez11
Edo11
Dagaare, Southern11
Tooro11
Soninke10
Nyamwezi10
Sebat Bet Gurage10
Moba10
Limbum10
Akoose10
Sénoufo, Mamara10
Sukuma10
Miyobe10
Tigré10
Nyoro10
Xaasongaxango10
Sénoufo, Supyire10
Fulfulde, Borgu10
Konkomba10
Oromo, Eastern10
Fulfulde, Maasina10
Pulaar10
Lukpa9
Mende9
Hdi9
Turkana9
Wolaytta9
Suba9
Dagara, Northern9
Kusaal9
Chumburung9
Ekegusii9
Arabic, Libyan Spoken9
Chopi9
Kenga9
Noon9
Esan9
Kongo, San Salvador9
Ghomálá’9
Yao9
Tamazight, Central Atlas9
Farefare9
Ntcham9
Ninzo9
Kim9
Ngangam9
Nzema9
Shilluk8
Kutu8
Nomaande8
Zande8
Kwere8
Bokobaru8
Rendille8
Lyélé8
Kigiryama8
Makaa8
Guinea-Bissau Creole8
Kimîîru8
Nkonya8
Saamya-Gwe8
Ngwo8
Mbuko8
Muyang8
Kwangali8
Dangaléat8
Nyole8
Kouya8
Lunda8
Baoulé8
Shambala8
Bedjond8
Ngambay8
Mankanya8
Alur8
Isoko8
Nobiin8
Jopadhola8
Gaa8
Baatonum8
Buamu8
Bafut8
Makhuwa-Meetto8
Boko8
Gamo8
Kitharaka8
Ndau8
Ebira8
Gogo8
Ivbie North-Okpela-Arhe8
Krumen, Tepo7
Tampulma7
Koma7
Paasaal7
Bari7
Nyungwe7
Dawro7
Chiduruma7
Ng’akarimojong7
Vai7
Daasanach7
Delo7
Equatorial Guinean Pidgin7
Lulogooli7
Noone7
Laalaa7
Sidamo7
Sénoufo, Djimini7
Paloor7
Ekajuk7
Kinga7
Bimoba7
Gwere7
Meta’7
Kipfokomo7
Babanki7
Aringa7
Obolo7
Uduk7
Haya7
Tarifit7
Gulay7
Nyabwa7
Kagulu7
Hehe7
Kwa7
Pökoot7
Gor7
Gofa7
Nyaturu7
Sena7
Kipsigis7
Samburu7
Hanga7
Siwu7
Maay7
Cerma7
Luba-Katanga7
Bulu7
Marba7
Avokaya7
Chiga7
Mampruli7
Mambila, Cameroon7
Buli7
Arigidi7
Jur Modo7
Birifor, Malba7
Klao7
Matal7
Kaonde7
Avatime7
Mumuye7
Kuranko7
Hamer-Banna7
Zigula7
Tswa7
Konzo7
Mundani7
Ditammari7
Adioukrou7
Kituba7
Luvale7
Maninkakan, Western7
Murle7
Birifor, Southern7
Kuria7
Tamazight, Standard Moroccan6
Kakwa6
Ngindo6
Jola-Kasa6
Ngulu6
Jola-Fonyi6
Ndamba6
Bilen6
Mina6
Denya6
Anyin6
Rigwe6
Vengo6
Datooga6
Wandala6
Merey6
Nyemba6
Nilamba6
Mada6
Nupe-Nupe-Tako6
Tamahaq, Tahaggart6
Mwan6
Deg6
Kunda6
Fang6
Toma6
Yalunka6
Ika6
Vili6
Vidunda6
Tobanga6
Kom6
Mwani6
Bisa6
Nuni, Southern6
Nateni6
Koromfé6
Tsikimba6
Tupuri6
Mofu-Gudur6
Mokole6
Ngomba6
Aja6
Talinga-Bwisi6
Sénoufo, Tagwana6
Musgu6
Gikyode6
Machame6
Masana6
Kuo6
Pogolo6
Kafa6
Oku6
Kalanga6
Yaouré6
Gbaya6
Dawida6
Gude6
Gonja6
Bandial6
Gokana6
Gyele6
Kua6
Vunjo6
Sénoufo, Cebaara6
Gen6
Izere6
Luwo6
Lobi6
Kumam6
Songhay, Koyraboro Senni6
Sabaot6
Shi6
Mbembe, Tigon6
Tuwuli6
Bamun6
Ndogo6
Kutep6
Anufo6
Lelemi6
Kimré6
Mabaan6
Koorete6
Biali6
Kako6
Bisã6
Basaa6
Lango6
Kambaata6
Khana5
Harari5
Arabic, Eastern Egyptian Bedawi Spoken5
Nya Huba5
Ndut5
Mbula-Bwazza5
Pam5
Nafaanra5
Idoma5
Kamwe5
Mündü5
Lomwe5
Zulgo-Gemzek5
Kono5
Kaansa5
Ruund5
Nkoya5
Somrai5
Samo, Southern5
Vwanji5
Basketo5
Chuwabu5
Bedawiyet5
Aghem5
Songe5
Olunyole5
Tunen5
Tem5
Ma’di5
Abua5
Sar5
Oluwanga5
Kpelle, Guinea5
Dinka, Northeastern5
Lango5
Majang5
Igala5
Berom5
Dagaari Dioula5
Bakoko5
Abidji5
Basa5
Luguru5
Limba, West-Central5
Tonga5
Lamba5
Takwane5
Lame5
Thur5
Ikwere5
Kisi, Southern5
Lama5
Vagla5
Awing5
Attié5
Konso5
Koonzime5
Rwa5
Kupsapiiny5
Anuak5
Sangu5
Otuho5
Ghanaian Pidgin English5
Yombe5
Kalabari5
Shekkacho5
Iraqw5
Loma5
Mambwe-Lungu4
Fungwa4
Giziga4
Grebo, Northern4
Sekpele4
ut-Hun4
Lega-Mwenga4
Karang4
Gidar4
Karaboro, Eastern4
Mattokki4
Sãotomense4
Laru4
Laka4
Gbaya, Northwest4
Krumen, Plapo4
Gola4
Lele4
Dida, Yocoboué4
Kabba4
Gbaya, Southwest4
Banda, South Central4
Vute4
Lusengo4
Tshuwau4
Shatt4
Tennet4
Lika4
Tunni4
Sandawe4
Kukele4
Kera4
Ebrié4
Okiek4
Kirike4
Eton4
Turka4
Konni4
Phende4
Tera4
Adele4
Lingua Franca4
Sisaala, Tumulung4
Pana4
Adangbe4
Asu4
Tikar4
Ahanta4
Tsishingini4
Olutsotso4
Gungu4
Ngbandi, Northern4
Hadza4
Mwera4
Sissala4
Nyamwanga4
Soli4
Yamba4
Kuhane4
Mogofin4
Baga Sitemu4
Nyiha, Tanzania4
Jibu4
Shubi4
Beembe4
Tula4
Burunge4
Wè Western4
Aweer4
Mayogo4
Tumak4
Pidgin Bantu4
Keliko4
Notre4
Moru4
Kota4
Iten4
Maan4
Sena, Malawi4
Bena4
Didinga4
Songhay, Koyra Chiini4
Kpelle, Liberia4
Nsenga4
Bana4
Subi4
Bali4
Ngando4
Kuwaataay4
Yala4
Ngiti4
Malila4
Gbagyi4
Dahalo4
Moro4
Mwimbi-Muthambi4
Nawuri4
Ndali4
Lambya4
Fuliiru4
Anaang4
Tee4
Bacama4
Bum4
Sherbro4
Bench4
Markweeta4
Ngombe4
Koti4
Loko4
Bete4
Matumbi4
Konabéré4
Bura-Pabir4
Bomu4
Bambalang4
Kiembu4
Gun4
Wamey4
Songhay, Humburi Senni4
Mofu, North4
Hadiyya4
Mbe4
Mukulu4
Odual4
Chakali4
Mwaghavul4
Toura4
Bullom So4
Kele4
Ngemba4
Maba4
Igede4
Izon4
Nyaneka4
Nyangbo4
Bokyi4
Bekwarra4
Comorian, Maore4
Morokodo4
Lobala4
Tonga4
Burji4
Karon4
Ikwo4
Kono4
Naro4
Doyayo4
Bandi4
Wongo4
Jita4
Moloko4
Mochi4
Zaramo4
Ejagham4
Gciriku4
Waama4
Chidigo4
Esahie4
Fipa4
Dinka, South Central4
Xamtanga4
Birwa4
Dhimba4
Kare4
Daba4
Lumun4
Kgalagadi4
Selee4
Ake4
Yemba4
Jukun Takum4
Medumba4
Kisi4
Psikye4
Nyala4
Ronga4
Katcha-Kadugli-Miri4
Abé4
Oniyan4
Nambya4
Logo4
Bafia4
Tyap4
Mbunda4
Migaama4
Sama3
Sangu3
Kamara3
Mesqan3
Naba3
Tachawit3
Kiwilwana3
Sénoufo, Palaka3
Narim3
Ik3
Mbukushu3
Ngbaka3
Olukhayo3
Téén3
Opo3
Lomwe, Malawi3
Mada3
Laari3
Sokoro3
Olumarachi3
Lendu3
Shall-Zwall3
Mbelime3
Kami3
Khe3
Gula3
Lutachoni3
Kwakum3
Kalamsé3
Mpumpong3
Amba3
Lefa3
Khisa3
Reshe3
Pagibete3
Maninka, Konyanka3
Kulango, Bondoukou3
Lolo3
Kissi, Northern3
Duala3
Elip3
Mafa3
N’ko3
Mba3
Mbwela3
Somyev3
Ngwe3
Natioro3
Gedeo3
Kistane3
Gavar3
Mefele3
Mmaala3
Mundang3
Ekpeye3
Kulango, Bouna3
Manyawa3
Nandi3
Teke-Kukuya3
Fali3
Kokola3
Fali, South3
Mpiemo3
Kung-Ekoka3
Mashi3
Omotik3
Ila3
Dogoso3
Esimbi3
Dida, Lakota3
Marenje3
Makhuwa-Shirima3
Wara3
Wan3
Bago-Kusuntu3
Ghotuo3
Ding3
Tsamai3
Kombe3
Mbudum3
Anii3
Mmen3
Nubi3
Iceve-Maci3
Comorian, Ngazidja3
Bangandu3
Tamajaq, Tawallammat3
Mak3
Taveta3
Juba Arabic3
Tuki3
Barein3
Kantosi3
Angolar3
Dogosé3
Makhuwa-Marrevone3
Kyoli3
Gichuka3
Cuvok3
Seeku3
Waata3
Dendi3
Baga Sobané3
Mokpwe3
Wumboko3
Sénoufo, Syenara3
Mengaka3
Wumbvu3
Ukwuani-Aboh-Ndoni3
Bozo, Jenaama3
Isu3
Zenaga3
Kuwaa3
Dadiya3
Njyem3
Bangala3
Buwal3
Ngombale3
Bambili-Bambui3
Nugunu3
Kwasio3
Swo3
Zaghawa3
Baka3
Tetela3
Teke-Tege3
Vale3
Wali3
Garre3
Arabic, Sudanese Spoken3
Animere3
Doondo3
Bafaw-Balong3
Pinyin3
Bushi3
Bamenyam3
Tachelhit3
Awngi3
Yaka3
Tunia3
Dazaga3
Phuie3
Dyan3
Marka3
Bati3
Siwi3
Abron3
Bassa3
Safaliba3
Viemo3
Guruntum-Mbaaru3
Siamou2
Kpagua2
Safwa2
Mandja2
Lalia2
Saya2
Nalu2
Kpasham2
Marfa2
Kabalai2
Sake2
Assangori2
Kupa2
Poke2
Simaa2
Oring2
Sagala2
Anyin Morofo2
Pere2
Kwadi2
Machinga2
South African Sign Language2
Mbangala2
Ngamo2
Mabire2
Sénoufo, Nyarafolo2
Kwambi2
Piya-Kwonci2
Oruma2
Myene2
Pangwa2
Pambia2
Suundi2
Kaningi2
Oyda2
Anfillo2
Mvuba2
Samba Leko2
Panawa2
Suba-Simbiti2
Tumtum2
Tamki2
Maslam2
Dera2
Sengele2
Sere2
Suri, Tirmaga-Chai2
Ruma2
Saba2
Suri, Kacipo-Bale2
Kubi2
Kusur-Myet2
Koyo2
Temi2
Kutto2
Seroa2
Sininkere2
Shwai2
Singa2
Sala2
Shanga2
Krahn, Eastern2
Ngie2
Teke, Ibali2
Ukwa2
Tswapong2
Ubi2
Numana2
Kwami2
Ndasa2
Tegali2
Ndunga2
Nungu2
Korop2
Rombo2
Sapo2
Neyo2
Ngizim2
Ruruuli-Runyala2
Tedaga2
Mser2
Tuotomb2
Mfumte2
Tsogo2
Tongwe2
Tembo2
Tima2
Tajuasohn2
Tal2
Songo2
Salampasu2
Krahn, Western2
Lagwan2
Waja2
Koke2
Kpan2
Waka2
Kujarge2
Viti2
Kisankasa2
Doko-Uyanga2
Kuku2
Ukue2
Ukhwejo2
Tiyaa2
Kwandu2
Kyanga2
Tawara2
Tsuvadi2
Tsaangi2
Kushi2
Zumaya2
Molo2
Mesme2
Zayse2
Yeyi2
Kendem2
Kofyar2
Kwaya2
Bungu2
Kanyok2
Lokaa2
Nde-Gbite2
Zeem2
Bomitaba2
Cahungwarya2
Mfinu2
Ngbaka Ma’bo2
Nnam2
Zimba2
Nde-Nsele-Nta2
Zinza2
Zhoa2
Bu2
Ndengereko2
Zari2
ǁXegwi2
Zanaki2
Yotti2
Yamongeri2
Yakoma2
Yela2
Yukuben2
Kolbila2
Agwagwune2
Makhuwa-Saka2
Gimme2
Kanufi2
Chingoni2
Kaamba2
Nyankpa2
Yauma2
Songoora2
Tagbu2
Mengisa2
Tiene2
Thuri2
Sagalla2
Massalat2
Mangbutu2
Mbere2
Dizin2
Tetserret2
Mendankwe-Nkwen2
Temein2
Toro2
Luchazi2
Umon2
Seze2
Manda2
Samay2
Seki2
Sasaru2
Mundat2
Sharwa2
Sira2
Suku2
Saho2
Songomeno2
Mono2
Maninka, Sankaran2
Toram2
Northwestern !Kung2
Duya2
Teke-Laali2
Lemoro2
Leelau2
Yambeta2
Lese2
Mbonga2
Likila2
Ligbi2
Libido2
Liberian English2
Kenye2
Päri2
Xiri2
Lombi2
Mbulungish2
Lopa2
Sambe2
Olumarama2
Wudu2
Wasa2
Olu’bo2
Mbo2
Wom2
Weh2
Alagwa2
Voro2
Verre2
Vanuma2
Mango2
Nkoroo2
Luri2
Beng2
Etulo2
Segeju2
Otank2
Nizaa2
Usaghade2
Sonde2
Shua2
Tanjijili2
Iyive2
Nggwahyi2
Uda2
Ngungwel2
Ubang2
Sheni2
Teke-Tsaayi2
Tarjumo2
Totela2
Tsuvan2
Tira2
Tulishi2
Ndonde Hamba2
Njerep2
Kunyi2
Terik2
Tita2
Teme2
Dogon, Tiranige Diga2
Sened2
Vinza2
Ngam2
Mpinda2
Kuvale2
Okpamheri2
Bena2
Osatu2
Iyansi2
Yendang2
Pye2
Badyara2
Phimbi2
Yom2
Yasa2
Polci2
Tarok2
Shamang2
Vumbu2
Xingoni2
Kara2
Mushungulu2
Kelo2
Rungwa2
Rogo2
Wawa2
Berta2
Warnang2
Wè Northern2
Warji2
Wanda2
Shabo2
Tagdal2
Ngombe2
Mbangwe2
Talodi2
Bukpe2
Uhami2
Papel2
Piapung2
Twendi2
Pangseng2
Pimbwe2
Tondi Songway Kiini2
Toposa2
Ménik2
Tama2
Pangu2
Rang2
Otoro2
Runga2
Togoyo2
Ciwogai2
Rom2
Rufiji2
Mala2
Tafi2
Gaam2
Tocho2
Taabwa2
Sanga2
Tangale2
Sukur2
Tagargrent2
Ososo2
Tala2
Sakata2
Tagoi2
Ngandjera2
Sinyar2
Sighu2
Nara2
Mesmes2
Sarua2
Sumbwa2
Ngoshie2
Settla2
Suma2
Shama-Sambuga2
Nyanga-li2
Senhaja Berber2
Oro2
Sheko2
Njebi2
Nzadi2
Nzakambay2
Vemgo-Mabas2
Obulom2
Ogbogolo2
us-Saare2
Okodia2
Okpe2
Uneme2
Ulukwumi2
Oorlams2
Zangwal2
Oko-Eni-Osayen2
Omi2
Okpe2
Pelende2
Luimbi2
Paleni2
Luyana2
Bo-Rukul2
Pero2
Abishi2
Masmaje2
Parkwa2
Glio-Oubi2
Orma2
Ombo2
Ogbronuagum2
Punu2
Ogbia2
Mbugu2
Alumu-Tesu2
Teke-Nzikou2
Nyang’i2
Mingang Doso2
Ndambomo2
Miltu2
Mlomp2
Nyamusa-Molo2
Ngbundu2
Ndunda2
Ikoma-Nata-Isenye2
Luna2
Saafi-Saafi2
Nigerian Sign Language2
Ngom2
Lenyima2
Landoma2
Oloma2
Okobo2
Odut2
Lenje2
Oblo2
Nzakara2
Nyanga2
Nyengo2
Ngundu2
Logorik2
Lower Nossob2
Nyangatom2
Surbakhal2
Lokoya2
Lala-Roba2
Ngiemboon2
!Xóõ2
Nyali2
Shau2
Nkongho2
Lamnso’2
Njalgulgule2
Hema2
Ngwaba2
Vori2
Sara Kaba2
Mboi2
Mpoto2
Efutop2
Noy2
Zialo2
Zhire2
Nzanyi2
Nkutu2
Nkari2
Ndumu2
Yulu2
Nawdm2
Yango2
Makwe2
Dwang2
Nyambo2
Banda-Yangere2
Mbandja2
Kwegu2
Ede Nago, Kura2
Sanga2
Ngasa2
Komo2
Majera2
Nathembo2
Ntomba2
Toussian, Southern2
Mbo2
Nyindu2
Ama2
Mbesa2
Mituku2
Tibea2
Mandari2
Ndemli2
Marghi Central2
Nshi2
Mangayat2
Manya2
Mambila, Nigeria2
Ngendelengo2
Nkukoli2
Ngamambo2
Nda’nda’2
Ngete2
Nancere2
Ndaka2
Ndolo2
Nyam2
Nagumi2
Ngundi2
Ngul2
Nyika2
Kebu2
Nkami2
Ndoola2
Nkangala2
Niellim2
Ngbaka Manza2
Gvoko2
Zerenkel2
Mangas2
Zan Gula2
Malimba2
Moi2
Manta2
Kamuku2
Damakawa2
Chichonyi-Chidzihana-Chikauma2
Chenoua2
Cakfem-Mushere2
Caka2
Kajakse2
Kibaku2
Cineni2
Atsam2
Dabarre2
Samba Daka2
Obanliku2
Bozo, Kelengaxo2
Bumaji2
Bina2
Bebil2
Boghom2
Bubi2
Bete-Bendi2
Daho-Doo2
Dengese2
Bassossi2
Fyer2
Ganzi2
ǁGana2
Lere2
Ganang2
Glavda2
Ndai2
Gengle2
Foma2
Fania2
Etebi2
Dhaiso2
Eman2
Nding2
Jola-Felupe2
Dogon, Tomo Kan2
C’Lela2
Dombe2
Dongo2
Dinka, Northwestern2
Dime2
Baiso2
Bira2
Gupa-Abawa2
Ambo2
Kulung2
Nubaca2
Ayu2
Leyigha2
Avikam2
Awjilah2
Aguna2
Cishingini2
Ngas2
Alaba-K’abeena2
Befang2
Alago2
Akwa2
Ali2
Qimant2
Ngelima2
Aduge2
Abure2
Abon2
Bankon2
Bubia2
Baga Pokur2
Bakaka2
Bekwel2
Boguru2
Bole2
Bonjo2
Mundabli2
Batanga2
Bangi2
Limassa2
Baga Manduri2
Bolia2
Boloki2
Hohumono2
Kwa’2
Bidiyo2
Bhele2
Beba2
Bebele2
Bhogoto2
Buduma2
Bai2
Bade2
Godié2
Ga’anda2
Degema2
Beti2
Gbari2
Gbanziri2
Fwe2
Fum2
Foodo2
Fam2
Eruwa2
Epie2
Engenni2
Gudu2
Eki2
Eggon2
Efe2
Diri2
Dii2
Dogon, Toro So2
Disa2
Dair2
Dorze2
Gadang2
Geruma2
Dong2
Gusilay2
Hungu2
Kerak2
Herdé2
Ganza2
Gbayi2
Gua2
Gwandara2
Moo2
Gwa2
Goundo2
Gade2
Ngen2
Mgbolizhia2
Gula Iro2
Ghadamès2
Enya2
Gera2
Eviya2
Geme2
Ywom2
Dompo2
Dama2
Tugbiri-Niragu2
Ajiya2
Jimjimen2
Jilbe2
Jere2
Nafusi2
Yaka2
Mesaka2
Olulumo-Ikom2
Ijo, Southeast2
Idesa2
Ede Ica2
Ngile2
Ibino2
Akpes2
ǁAni2
Lamang2
Heiban2
Hamba2
Gyaazi2
Gumuz2
Shiki2
Bankal2
Yemsa2
Mazagway-Hidi2
Kerewe2
Kadung2
Kiong2
Kakabe2
Kakanda2
Kimbu2
Kariya2
Kuturmi2
Koyaga2
Kpessi2
Keiga2
Wannu2
Konongo2
Kamo2
Kanga2
Duhwa2
Jaya2
Jwira-Pepesa2
Ju2
Jiba2
Wapan2
Andaandi2
Duwai2
Holoholo2
Dulbu2
Afitti2
Áncá2
Aasáx2
Daju, Dar Daju2
Dirim2
Dagba2
Dewoin2
Dungu2
Duguri2
Argobba2
Bangime2
Daju, Dar Fur2
Maindo2
Cara2
Cutchi-Swahili2
Kasanga2
Tsucuba2
Izora2
Bozaba2
Legbo2
Àhàn2
Ubaghara2
Baga Koga2
Boma2
Beezen2
Bangubangu2
Bomboli2
Bamwe2
Buraka2
Busuu2
Bila2
Biafada2
Gwamhi-Wuri2
Ashe2
Bende2
Morom2
Burun2
Baka2
Bakpinka2
Barama2
Bangba2
Akuku2
Aja2
Berti2
Bikya2
Banda-Banda2
Aninka2
Oroko2
Baldemu2
Shoo-Minda-Nye2
Mburku2
Bainouk-Gunyuño2
Ginyanga2
Ayere2
Cicipu2
Reel2
Goemai2
Mbongno2
Amdang2
Akpa2
Aizi, Tiagbamrin2
Awutu2
Eloyi2
Defaka2
Lidzonka2
Acipa, Eastern2
Abanyom2
Bangolan2
Budza2
Jalkunan2
Batu2
Bomboma2
Bobo Madaré, Southern2
Bwile2
Bamunka2
Bure2
Bonkeng2
Budu2
Bongili2
Bushoong2
Bauchi2
Kyak2
Birked2
Buru2
Banda-Mbrès2
Bung2
Binji2
Bondei2
Benga2
Bom-Kim2
Beeke2
Bozo, Tiemacèwè2
Wushi2
Deno2
Balanta-Kentohe2
Tchumbuli2
Bodo2
Bongo2
Bolon2
Boon2
Bomwali2
Bagirmi2
Bembe2
Beli2
Pande2
Bamukumbit2
Bakwé2
Balanta-Ganja2
Bidyogo2
Kol2
Banda-Ndélé2
Bafanji2
Bofi2
Besme2
Bayot2
Balo2
Baangi2
Bendi2
Busam2
Daju, Dar Sila2
Day_2
Duupa2
Chara2
Kpeego2
Cherepon2
Camtho2
Karimjo2
Ibaas2
Ongota2
Barambu2
Borna2
Bishuo2
Birri2
Belanda Viri2
Boor2
Birgit2
Yangkam2
Bassa-Kontagora2
Bitare2
Bainouk-Gunyaamolo2
Bali2
Bangwinji2
Belanda Bor2
Doka2
Kabwa2
Ede Cabe2
Evant2
Bolondo2
Burak2
Bankagooma2
Barikanchi2
Molengue2
Bwisi2
Dongotono2
Bwela2
Barwe2
Boga2
Bolgo2
Bukwen2
Basa-Gurmana2
Bua2
Bata2
Buso2
Dogon, Bankan Tey2
Dek2
Bainouk-Samik2
Amo2
Banda, West Central2
Babango2
Mbat2
Awak2
Asu2
Asoa2
Ipulo2
Sari2
Arbore2
Elege2
Dghwede2
Alladian2
Akaselem2
Ukpet-Ehom2
Agatu2
Utugwang-Irungene-Afrike2
Acheron2
Dibo2
Dilling2
Dimbong2
Holu2
Lubila2
Zay2
Laal2
Ghomara2
Mahou2
Ghulfan2
Mayeka2
Màwés Aasʼè2
Gbii2
Tumzabt2
Dirasha2
Mbowe2
Gbanu2
Gbaya-Bozoum2
Gbaya-Bossangoa2
Iguta2
Fe’fe’2
Ndoe2
Flaaitaal2
Gibanawa2
Moingi2
Fongoro2
Yaaku2
Mpade2
Holma2
Kahe2
Musey2
Mansoanka2
Havu2
Mubi2
Gayil2
Galambu2
Mursi2
Mulgi2
Gurmana2
Taznatit2
Gobu2
Gimnime2
Glaro-Twabo2
Ngbinda2
Fang2
Horom2
Kugama2
Tese2
Kadaru2
Nikyob-Nindem2
Kachama-Ganjule2
Kaivi2
Kendeje2
Gwama2
Kplang2
Tumi2
Ikposo2
Wãpha2
Kurama2
Hõne2
Kumba2
Kofa2
Kwaami2
Krongo2
Lopit2
Uzekwe2
Ega2
Nchane2
Emai-Iuleha-Ora2
Ndam2
Ndombe2
Eleme2
Yace2
Ekit2
Dogon, Yanda Dom2
Kinuku2
Mositacha2
Ndrulo2
Ndendeule2
Ndobo2
Dema2
Duma2
Dahalik2
Mogum2
Wané2
Kusu2
Korana2
Juǀ’hoansi2
Mamvu2
Karanga2
Kholok2
Shuwa-Zamani2
Mangbetu2
Mbati2
Mbala2
Kunama2
Kudu-Camo2
Bakole2
Kapya2
Giiwo2
Kir-Balar2
Mbosi2
Kibet2
Kulere2
Mbum2
Koalib2
Ligenza2
Lumbu2
Mongo-Nkundu2
Logol2
Lonzo2
Lwalu2
Ombamba2
Banda-Bambari2
Mvanip2
Kwang2
Larteh2
Kango2
Karaboro, Western2
Kpatili2
Kaba Naa, Sara2
Kwese2
Kaba Démé, Sara2
Midob2
Kung2
Monzombo2
Iku-Gora-Ankwa2
Makhuwa-Moniga2
Mungbam2
Nkem-Nkum2
Mbre2
Inor2
Ilue2
Miya2
Mbule2
Isekiri2
Ihievbe2
Masalit2
Indri2
Mama2
Ede Idaca2
Etkywan2
Hwana2
Isanzu2
Ito2
Putai2
Mararit2
Akebu2
Kugbo2
Gyong2
Kabwari2
Kete2
Koenoem2
Karekare2
Mbangi2
Jara2
Adara2
Jakattoe2
Mbunga2
Jowulu2
Jimi2
Mahongwe2
Labir2
Janji2
Ngoreme2
Lombo2
Hassaniyya2
Dzando2
Daatsʼíin2
Mawa2
Doe2
Mbole2
Mboko2
Dzùùngoo2
Dugwor2
Ebughu2
Naki2
Marghi South2
Kaiku2
Karko2
Koshin2
Kanu2
Nkumbi2
Koro2
Mambai2
Teke-Eboo2
Mandjak2
Ezaa2
Libinza2
Olushisa2
Gabri2
Fur_2
Furu2
Feroge2
Langbashe2
Keiyo2
Efai2
Uvbie2
Mbalanhu2
Etsako2
Likwala2
Laro2
El Molo2
El Hugeirat2
Ehueun2
Kela2
Melo2
Mo’da2
Ajumbu2
Jiiddu2
Ma2
Hyam2
Montol2
Izii2
Iyayu2
Itu Mbon Uzo2
Mubako2
Joba2
Ikpeshi2
Manyika2
Igwe2
Teke-Fuumu2
Ifè2
Idere2
Luidakho-Luisukha-Lutirichi2
Malgbe2
Zumbun2
Jumjum2
Abureni2
Vame2
Matengo2
Mbugwe2
Kélé2
Kwaja2
Seba2
Korandje2
Byep2
Mabaale2
Jju2
Mbara2
Katla2
Kobiana2
Kamantan2
Vono2
Ukaan2
Kande2
Kari2
Logba2
Enwan2
Biseni2
Karo2
Gorowa2
Gwak2
Ale2
Gweno2
Gyem2
Hangaza2
Ikulu2
Hemba2
Langi2
Hai|ǁom2
Hijuk2
Hunde2
Dza2
Guro2
Hozo2
Bijim2
Lafofa2
Guduf-Gava2
Kulfa2
Jahanka2
Hungana2
Likuba2
Krache2
Ikizu2
Hya2
Agoi2
Ibuoro2
Lamja-Dengsa-Tola2
Ha2
Lengola2
Magoma2
Lega-Shabunda2
Jina2
Gule2
Leti2
Krobu2
Mághdì2
Lufu2
Tso2
Gundi2
Kodia2
Comorian, Ndzwani1
Arabic, Chadian Spoken1
Bozo, Tieyaxo1
Malagasy, Southern Betsimisaraka1
Gbe, Maxi1
Malagasy, Tandroy-Mahafaly1
Winyé1
Chala1
Sénoufo, Senara1
Dendi1
Aari1
Tiéfo1
Lukabaras1
Principense1
Nǁng1
Moroccan Sign Language1
Guébie1
Réunion French Creole1
Yangben1
Komo1
Tanzanian Sign Language1
Khwedam1
Tamajeq, Tayart1
Silt’e1
Kagoro1
Kenyan Sign Language1
Malagasy, Northern Betsimisaraka1
Fa d’Ambu1
Malagasy, Sakalava1
ǀXam0
Igo0
Ngbandi, Southern0
Malagasy, Antankarana0
Lutos0
Zimbabwe Sign Language0
To_0
Longuda0
Malagasy, Tesaka0
Toussian, Northern0
Gbe, Weme0
Kpala0
Aizi, Mobumrin0
Nago, Northern0
Ajawa0
Namibian Sign Language0
Adamorobe Sign Language0
Loma0
Ambele0
Arabic, Judeo-Tunisian0
Afade0
Comorian, Mwali0
Arabic, Sa’idi Spoken0
Dogon, Bunoge0
Tchitchege0
Dagik0
Baga Kaloum0
Kla-Dan0
Kasabe0
Gbe, Kotafon0
Mbo’0
Mbembe, Cross River0
Tamazight, Tidikelt0
Gamo-Ningi0
Malagasy Sign Language0
Mono0
Malagasy, Tanosy0
Me’en0
Malian Sign Language0
Samo, Maya0
Bille0
Mittu0
Miship0
Mwera0
Boko0
Bwamu, Láá Láá0
Maninkakan, Kita0
Ngando0
Supapya0
Dogon, Bondum Dom0
Dogon, Ben Tey0
Maaka0
Ahwai0
Beele0
Dikaka0
Gbe, Western Xwla0
Centúúm0
Arabic, Judeo-Moroccan0
So_0
Akum0
Yaka0
Gbe, Ayizo0
Buyu0
Bwa0
Bété, Guiberoua0
La’bi0
Kpati0
Ma’di, Southern0
Kango0
Ibani0
Burundian Sign Language0
Weyto0
Kirya-Konzel0
Fali, North0
Bu0
Gbe, Eastern Xwla0
Gail0
Gbaya-Mbodomo0
Malagasy, Tsimihety0
Isu0
ǂ’Amkhoe0
Mashi0
Guinean Sign Language0
Pana0
Horo0
Hausa Sign Language0
Wè Southern0
ǀGwi0
Grebo, Southern0
Jeri Kuo0
Ngbee0
Firan0
Ethiopian Sign Language0
Gula0
Tadaksahak0
Dogon, Donno So0
O’chi’chi’0
Nayi0
Longto0
Wolof, Gambian0
Wali0
Wolane0
Kw’adza0
Wandji0
Egyptian Sign Language0
Gbe, Waci0
Zambian Sign Language0
Dogon, Toro Tegu0
Tunzuii0
Mpuono0
Worodougou0
Zula0
Wojenaka0
Mbessa0
Teke-Tyee0
‡Ungkue0
Kemedzung0
Gbe, Gbesi0
Tewe0
Tasawaq0
Fali of Baissa0
Ogbah0
Tugen0
Tebul Sign Language0
Tunisian Sign Language0
Torona0
Banda, Togbo-Vara0
Ugandan Sign Language0
Grebo, Gboloo0
Kara0
Jonkor Bourmataguil0
Grebo, Central0
Grebo, Barclayville0
Samo, Matya0
Sisaala, Western0
Iko0
Ruwila0
Wuzlam0
Enwan0
Girirra0
Ngongo0
Tamazight, Temacine0
Tonjon0
Gbe, Defi0
Gbe, Tofin0
Njen0
Gbe, Saxwe0
Sawknah0
Dogon, Mombo0
Sango, Riverain0
Dogon, Jamsay0
Dogon, Tene Kan0
Dogon, Ana Tinga0
Dogon, Tommo So0
Sénoufo, Sìcìté0
Sénoufo, Nanerigé0
Dogon, Tebul Ure0
Koro Zuba0
Koro Nulu0
Duli-Gey0
Gbe, Xwela0
Hasha0
Soo0
Chadian Sign Language0
Pa’a0
Lala-Bisa0
Baan0
Dibole0
Pinji0
Kami0
Baygo0
Naami0
Lele0
Pyam0
Dogon, Nanga Dama0
Chungmboko0
Nyokon0
Dogon, Dogul Dom0
Songo0
Arabic, Algerian Saharan Spoken0
Mauritian Sign Language0
Nimbari0
Ede Ije0
Kuk0
Rer Bare0
Homa0
Bété, Daloa0
Atong0
Tembo0
Laimbue0
Limba, East0
Lele0
Loo0
Libyan Sign Language0
Laka0
Malagasy, Bara0
Koro Wachi0
Samba0
Siri_0
Banda, Mid-Southern0
Sierra Leone Sign Language0
Kanuri, Tumari0
Kanuri, Bilma0
Sénoufo, Shempire0
Kwanja0
Kono0
Nchumbulu0
Aizi, Aproumu0
Yeni0
Fulfulde, Bagirmi0
Nindi0
Kibala0
Nuni, Northern0
ut-Ma’in0
Mozambican Sign Language0
Nteng0
Nago, Southern0
Kenswei Nsei0
Basa-Gumna0
Malagasy, Masikoro0
Bété, Gagnoa0
Bube0
Berakou0
Dugun0
Zizilivakan0
Duungooma0
Gbe, Ci0
Dass0
Nyika, Tanzania0
Gban0
Baloi0
Menka0
Mbato0
Sha0
Simba0
Dogon, Ampari0
Algerian Sign Language0
Kuce0
Aushi0
Krumen, Pye0
Lijili0
Mungaka0
Nyiha, Malawi0
Barombi0
Bamali0
Muskum0
Ghanaian Sign Language0
Kobo0
Bon Gula0
Belning0
Gilima0
Giziga, North0
Malawian Sign Language0
View more

Tasks

natural language processing436
machine translation187
automatic speech recognition123
named entity recognition52
speech processing49
text to speech47
language modeling39
sentiment analysis31
embeddings29
dialect27
sentiment classification27
question answering25
part of speech tagging21
news18
text classification16
topic classification15
summarization15
speech translation13
parsing11
natural language understanding9
language identification8
natural language inference8
code switching8
keyword spotting8
7
dependency parsing7
media7
stopwords6
automatic content extraction5
transfer learning5
accent5
keywords5
cross-language transfer5
information extraction5
news classification4
information detection3
sign-language to text3
text normalization3
biomedical3
speaker verification3
natural language generation3
data to text3
conditional text generation3
emotion identification2
automatic speech translation2
commonsense reasoning2
speech-to-text translation2
speech language identification1
optical character recognition1
hate speech detection1
grammar error correction1
joeynmt1
image classification1
image-text retrieval1
information retrieval1
professional translation1
semantic role labelling1
coreference resolution1
computer vision1
View more

ANTC — African News Topic Classification Dataset

We created a novel dataset, ANTC — African News Topic Classification for 4 African languages. We obtained data from three different news sources: VOA, BBC6 and isolezwe7 . From the VOA data we created datasets for Lingala and Somali. We obtained the topics from data released by Palen-Michel et al. (2022) and used the provided urls to get the news category from the websites. For pidgin and isiZulu, we scrapped news topic from the respective news website (BBC Pidgin and isolezwe respectively) directly base on their category. We noticed that some news topics are not mutually exclusive to their categories, therefore, we filtered such topics with multiple labels. Also, we ensured that each category has at least 200 samples. The categories include but not limited to, Africa, Entertainment, Health, and Politics. The pre-processed datasets were divided into training, development, and test sets using stratified sampling with a ratio of 70:10:20. Appendix A.2 has more details about the dataset size and news topic information.

Expand Abstract

topic classification

African Voices

African Voices is a collaborative project that aims to collects high-quality speech (tts) datasets and synthesizers for all African languages. You can search datasets and synthesizers by language. You can also synthesize text from your synthesizer of choice. Additi...

Expand Abstract

AfroLID: A Neural Language Identification Tool for African Languages

AfroLID is a powerful neural toolkit for African languages identification which covers 517 African languages....

AfroLID is a powerful neural toolkit for African languages identification which covers 517 African languages.

Expand Abstract

AfroLID: A Neural Language Identification Tool for African Languages

Language identification (LID) is a crucial precursor for NLP, especially for mining web data. Problematically, most of the world's 7000+ languages today are not covered by LID technologies. We address this pressing issue for Africa by introducing AfroLID, a neural LID toolkit for 517 African languages and varieties. AfroLID exploits a multi-domain web dataset manually curated from across 14 language families utilizing five orthographic systems. When evaluated on our blind Test set, AfroLID achieves 95.89 F_1-score. We also compare AfroLID to five existing LID tools that each cover a small number of African languages, finding it to outperform them on most languages. We further show the utility of AfroLID in the wild by testing it on the acutely under-served Twitter domain. Finally, we offer a number of controlled case studies and perform a linguistically-motivated error analysis that allow us to both showcase AfroLID's powerful capabilities and limitations.

Expand Abstract

AfroMT

Code for the EMNLP 2021 Paper AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages.

machine translation

AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages

Reproducible benchmarks are crucial in driving progress of machine translation research. However, existing machine translation benchmarks have been mostly limited to high-resource or well-represented languages. Despite an increasing interest in low-resource machine translation, there are no standardized reproducible benchmarks for many African languages, many of which are used by millions of speakers but have less digitized textual data. To tackle these challenges, we propose AfroMT, a standardized, clean, and reproducible machine translation benchmark for eight widely spoken African languages. We also develop a suite of analysis tools for system diagnosis taking into account the unique properties of these languages. Furthermore, we explore the newly considered case of low-resource focused pretraining and develop two novel data augmentation-based strategies, leveraging word-level alignment information and pseudo-monolingual data for pretraining multilingual sequence-to-sequence models. We demonstrate significant improvements when pretraining on 11 languages, with gains of up to 2 BLEU points over strong baselines. We also show gains of up to 12 BLEU points over cross-lingual transfer baselines in data-constrained scenarios. All code and pretrained models will be released as further steps towards larger reproducible benchmarks for African languages.

Expand Abstract

machine translation

Bible TTS

BibleTTS is a large high-quality open Text-to-Speech dataset with up to 80 hours of single speaker, studio quality 48kHz recordings for each language. We release aligned speech and text for six languages spoken in Sub-Saharan Africa, with unaligned data for four additional languages, derived from the Biblica open.bible project. The BibleTTS corpus consists of high-quality audio released as 48kHz, 24-bit, mono-channel FLAC files. Recordings for each language consist of a single speaker recorded under professional quality, close-microphone conditions (i.e., without background noise or echo). BibleTTS is rare among public speech corpora for the volume of data available per speaker and its suitability for creating TTS models. Furthermore, the corpus consists of ten languages which are under-represented in today’s voice technology landscape, both in academia and in industry.

Expand Abstract

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

BibleTTS is a large, high-quality, open speech dataset for ten languages spoken in Sub-Saharan Africa. The corpus contains up to 86 hours of aligned, studio quality 48kHz single speaker recordings per language, enabling the development of high-quality text-to-speech models. The ten languages represented are: Akuapem Twi, Asante Twi, Chichewa, Ewe, Hausa, Kikuyu, Lingala, Luganda, Luo, and Yoruba. This corpus is a derivative work of Bible recordings made and released by the Open.Bible project from Biblica. We have aligned, cleaned, and filtered the original recordings, and additionally hand-checked a subset of the alignments for each language. We present results for text-to-speech models with Coqui TTS. The data is released under a commercial-friendly CC-BY-SA license.

Expand Abstract

BloomLM

This version of the Bloom Library data is developed specifically for the language modeling task. It includes data from nearly 400 languages across 35 language families, with many of the languages represented being extremely low resourced languages. Note: If you speak one of these languages and can help provide feedback or corrections, please let us know! https://huggingface.co/sil-ai

Expand Abstract

Building African Voices

Modern speech synthesis techniques can produce natural-sounding speech given sufficient high-quality data and compute resources. However, such data is not readily available for many languages. This paper focuses on speech synthesis for low-resourced African languages, from corpus creation to sharing and deploying the Text-to-Speech (TTS) systems. We first create a set of general-purpose instructions on building speech synthesis systems with minimum technological resources and subject-matter expertise. Next, we create new datasets and curate datasets from "found" data (existing recordings) through a participatory approach while considering accessibility, quality, and breadth. We demonstrate that we can develop synthesizers that generate intelligible speech with 25 minutes of created speech, even when recorded in suboptimal environments. Finally, we release the speech data, code, and trained voices for 12 African languages to support researchers and developers.

Expand Abstract

Cc100

This corpus is an attempt to recreate the dataset used for training XLM-R. This corpus comprises of monolingual data for 100+ languages and also includes data for romanized languages (indicated by *_rom). This was constructed using the urls and paragraph indices provided by the CC-Net repository by processing January-December 2018 Commoncrawl snapshots. Each file comprises of documents separated by double-newlines and paragraphs within the same document separated by a newline. The data is generated using the open source CC-Net repository. No claims of intellectual property are made on the work of preparation of the corpus.

Expand Abstract

language modeling

Ccaligned Multilingual

CCAligned consists of parallel or comparable web-document pairs in 137 languages aligned with English. These web-document pairs were constructed by performing language identification on raw web-documents, and ensuring corresponding language codes were corresponding in the URLs of web documents. This pattern matching approach yielded more than 100 million aligned documents paired with English. Recognizing that each English document was often aligned to mulitple documents in different target language, we can join on English documents to obtain aligned documents that directly pair two non-English documents (e.g., Arabic-French).

Expand Abstract

CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs

Cross-lingual document alignment aims to identify pairs of documents in two distinct languages that are of comparable content or translations of each other. In this paper, we exploit the signals embedded in URLs to label web documents at scale with an average precision of 94.5% across different language pairs. We mine sixty-eight snapshots of the Common Crawl corpus and identify web document pairs that are translations of each other. We release a new web dataset consisting of over 392 million URL pairs from Common Crawl covering documents in 8144 language pairs of which 137 pairs include English. In addition to curating this massive dataset, we introduce baseline methods that leverage cross-lingual representations to identify aligned documents based on their textual content. Finally, we demonstrate the value of this parallel documents dataset through a downstream task of mining parallel sentences and measuring the quality of machine translations from models trained on this mined data. Our objective in releasing this dataset is to foster new research in cross-lingual NLP across a variety of low, medium, and high-resource languages.

Expand Abstract

Cross-lingual Name Tagging and Linking for 282 Languages

The ambitious goal of this work is to develop a cross-lingual name tagging and linking framework for 282 languages that exist in Wikipedia. Given a document in any of these languages, our framework is able to identify name mentions, assign a coarse-grained or fine-grained type to each mention, and link it to an English Knowledge Base (KB) if it is linkable. We achieve this goal by performing a series of new KB mining methods: generating “silver-standard” annotations by transferring annotations from English to other languages through cross-lingual links and KB properties, refining annotations through self-training and topic selection, deriving language-specific morphology features from anchor links, and mining word translation pairs from cross-lingual links. Both name tagging and linking results for 282 languages are promising on Wikipedia data and on-Wikipedia data.

Expand Abstract

named entity recognition

FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech

We introduce FLEURS, the Few-shot Learning Evaluation of Universal Representations of Speech benchmark. FLEURS is an n-way parallel speech dataset in 102 languages built on top of the machine translation FLoRes-101 benchmark, with approximately 12 hours of speech supervision per language. FLEURS can be used for a variety of speech tasks, including Automatic Speech Recognition (ASR), Speech Language Identification (Speech LangID), Translation and Retrieval. In this paper, we provide baselines for the tasks based on multilingual pre-trained models like mSLAM. The goal of FLEURS is to enable speech technology in more languages and catalyze research in low-resource speech understanding.

Expand Abstract

Lanfrica Mailing List

Thank you for subscribing to our newsletter.

Filter Records

Languages

Tasks

Record Types

Tags