دورية أكاديمية

ANAD: Arabic news article dataset

التفاصيل البيبلوغرافية
العنوان: ANAD: Arabic news article dataset
المؤلفون: Mohammed Altamimi, Abdulaziz M. Alayba
المصدر: Data in Brief, Vol 50, Iss , Pp 109460- (2023)
بيانات النشر: Elsevier, 2023.
سنة النشر: 2023
المجموعة: LCC:Computer applications to medicine. Medical informatics
LCC:Science (General)
مصطلحات موضوعية: Arabic news articles, Data analysis, Classification, Natural language processing (NLP), Computer applications to medicine. Medical informatics, R858-859.7, Science (General), Q1-390
الوصف: In this paper, we present a modern standard Arabic dataset based on Arabic news articles collected over a one-year period from 01/01/2021 to 12/31/2021. In total, from 12 Arabic news websites, over 500,000 articles were collected, the selection of which was driven by a variety of topics, including sports, economies, local news, politics, tech, tourism, entertainment, cars, health, and art. The development of this dataset will enable data scientists to explore and experiment effectively in the field of natural language processing, and the dataset can also be used to develop machine learning and deep learning models to classify articles according to topic. The dataset is available for download athttps://github.com/alaybaa/ArabicArticlesDataset/tree/main.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2352-3409
Relation: http://www.sciencedirect.com/science/article/pii/S2352340923005607; https://doaj.org/toc/2352-3409
DOI: 10.1016/j.dib.2023.109460
URL الوصول: https://doaj.org/article/ab153bffb8964010a788149ca0dec8f0
رقم الأكسشن: edsdoj.b153bffb8964010a788149ca0dec8f0
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:23523409
DOI:10.1016/j.dib.2023.109460