Back to open data
NLP & Language
MA_Open_Datasets — Goud.ma
About
Part of MA_Open_Datasets collection. Goud.ma news articles in CSV format for NLP research. Includes train/test/validation splits.
https://github.com/OumaimaHourrane/MA_Open_Datasets/tree/main/Goud.ma
Visit WebsiteIn the same category
Goud-sum (HuggingFace) — Darija Summarization Dataset
158k articles + headlines from Goud.ma — Darija/MSA text summarization dataset
Darija Open Dataset (DODa)
100k+ darija↔English entries — largest open source Darija translation dataset
MA_Open_Datasets — LeMatin
Le Matin newspaper articles by category — nation, economy, culture, sports
MA_Open_Datasets — MoroccoWorldNews
Morocco news articles dataset from MoroccoWorldNews