About
Browse records
Blog
Contribute
Home
About
Browse records
Blog
Contribute
Lanfrica’s new look is almost here! Get ready for a whole new way to discover.
Learn more
.
X
×
Loading…
Oscar
The Open Super-large Crawled ALMAnaCH coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.
Link
Link
CONNECTED RECORDS
paper
paper
LANGUAGES
amharic
swahili
yoruba
somali
arabic, egyptian spoken
afrikaans
TASKS
language modeling
Cookies are used on the Lanfrica website to ensure you get the best experience.
Got it