Back to open data
NLP & Language

OMCD — Offensive Moroccan Comments Dataset

About

OMCD: 8024 Moroccan Darija YouTube comments manually labeled for offensive language detection. Published 2023 in Springer Language Resources & Evaluation. Useful for content moderation systems in Darija.

https://github.com/kabilessefar/OMCD-Offensive-Moroccan-Comments-Dataset
Visit Website