Lanfrica’s new look is almost here! Get ready for a whole new way to discover. .
X

The Aya Dataset

The Aya Dataset is a multilingual instruction fine-tuning dataset curated by an open-science community via Aya Annotation Platform from Cohere For AI. The dataset contains a total of 204k human-annotated prompt-completion pairs along with the demographic data of the annotators.

This dataset can be used to train, finetune, and evaluate multilingual LLMs.


Link

CONNECTED RECORDS

LANGUAGES

TASKS

TAGS