SANAD - sporedata/researchdesigneR GitHub Wiki
General description
The SANAD (Single-Label Arabic News Articles Dataset) is an extensive collection of over 90,000 Arabic news articles categorized in multiple fields, including Culture, Finance, Medical, Politics, Religion, Sports, and Tech, and that can be used in different Arabic NLP tasks such as Text Classification and Word Embedding.