Please, consider giving your feedback on using Lanfrica so that we can know how best to serve you. To get started, .
X

The Aya Dataset

The Aya Dataset is a multilingual instruction fine-tuning dataset curated by an open-science community via Aya Annotation Platform from Cohere For AI. The dataset contains a total of 204k human-annotated prompt-completion pairs along with the demographic data of the annotators.

This dataset can be used to train, finetune, and evaluate multilingual LLMs.


Link

CONNECTED RECORDS

LANGUAGES

TASKS

TAGS