Please, consider giving your feedback on using Lanfrica so that we can know how best to serve you. To get started, .
X

MassiveSumm

This repository contains links to data and code to fetch and reproduce the data described in our EMNLP 2021 paper titled "MassiveSumm: a very large-scale, very multilingual, news summarisation dataset". A (massive) multilingual dataset consisting of 92 diverse languages, across 35 writing scripts. With this work we attempt to take the first steps towards providing a diverse data foundation for in summarisation in many languages.


Link

CONNECTED RECORDS

LANGUAGES

TASKS

TAGS