Cookies are used on the Lanfrica website to ensure you get the best experience.
TUNIZI is referred as Tunisian Arabizi which is the representation of the tunisian dialect written in Latin characters and numbers rather than Arabic letters. This type of writing is most used on social media platformssince it presents an easier way to communicate. Also, since the Tunisian dialect already contains French and English words, people tend to use Tu-nizi for easier typing of non formal texts. Wa present TUNIZI dataset: the first 100% Tunisian Arabizi Sentiment Analysis dataset. TUNIZI is the largest dataset dedicated for the TUNIZI typing, annotated as positive, negative, and neutral and preprocessed for Sentiment Analysis subtask.