Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data

التفاصيل البيبلوغرافية
العنوان: Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
المؤلفون: Ahmad Kamran Malik, Ahmad Raza Shahid, Yousra Asim, Nafees Qamar, Basit Raza
المصدر: IEEE Access. 9:6836-6854
بيانات النشر: Institute of Electrical and Electronics Engineers (IEEE), 2021.
سنة النشر: 2021
مصطلحات موضوعية: General Computer Science, Computer science, business.industry, General Engineering, Context (language use), Unstructured data, 02 engineering and technology, Machine learning, computer.software_genre, Random forest, Identification (information), Statistical classification, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, General Materials Science, Case-based reasoning, Artificial intelligence, business, computer
الوصف: Bloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or services among diversified people belonging to varying localities, and is always on the lookout for suitable and quick ways to grasp public access. Due to this reason, influential bloggers are preferred in the online market to initiate marketing campaigns which is a thought-provoking task due to loads of blogger communities. The novelty of this paper lies in the proposed Framework for Influential Blogger Prediction based on Blogger and Blog Features (IBP-BBF) using Case-Based Reasoning (CBR) which is not only capable of handling labeled data but also unstructured data (blogs) and imbalanced data in an optimized way. Detailed labelled and unstructured data are collected by online survey of 129 bloggers and text mining of their 32,200 blogs respectively. The classification results are compared and validated with state-of-the-art machine learning techniques by using standard evaluation measures respectively in the context of imbalanced data. The results show that the proposed IBP-BBF framework through CBR modeling outperforms existing techniques in classifying and adapting the influential blogger prediction. The IBP-BBF framework performed better as compared to baseline imbalanced data classification techniques. It is found that the Balanced Random Forest contributes towards the performance of CBR approach than Balanced Bagging Classifier and RUSBoost classifier. By using the CBR approach, baseline techniques can be optimized for influential blogger identification in a better way.
تدمد: 2169-3536
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::1636cd0f1bcf0b85b342f04c0bb6560b
https://doi.org/10.1109/access.2020.3048610
حقوق: OPEN
رقم الأكسشن: edsair.doi...........1636cd0f1bcf0b85b342f04c0bb6560b
قاعدة البيانات: OpenAIRE