Lanfrica’s new look is almost here! Get ready for a whole new way to discover. .
X

The Vuk'uzenzele South African Multilingual Corpus

The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtatined from the Vuk'uzenzele website (https://www.vukuzenzele.gov.za/).

The datasets contain government magazine editions in 11 languages.


Link Other Links

CONNECTED RECORDS

LANGUAGES

TASKS