SANAD - sporedata/researchdesigneR GitHub Wiki

General description

The SANAD (Single-Label Arabic News Articles Dataset) is an extensive collection of over 90,000 Arabic news articles categorized in multiple fields, including Culture, Finance, Medical, Politics, Religion, Sports, and Tech, and that can be used in different Arabic NLP tasks such as Text Classification and Word Embedding.

Data access

Arabic News Articles Dataset