Back to open data
NLP & Language

IADD — Integrated Arabic Dialect Identification Dataset

About

Integrated Arabic Dialect Identification Dataset (IADD): 135,804 texts from Twitter, Facebook, manual transcriptions, and news comments. Covers Maghrebi (incl. Morocco), Levantine, Egyptian, Gulf dialects. Published 2022 in Data in Brief.

https://github.com/JihadZa/IADD
Visit Website