Which Hammer Should I Use? A Systematic Evaluation of Approaches for Classifying Educational Forum Posts

التفاصيل البيبلوغرافية
العنوان: Which Hammer Should I Use? A Systematic Evaluation of Approaches for Classifying Educational Forum Posts
اللغة: English
المؤلفون: Sha, Lele, Rakovic, Mladen, Li, Yuheng, Whitelock-Wainwright, Alexander, Carroll, David, Gaševic, Dragan, Chen, Guanliang
المصدر: International Educational Data Mining Society. 2021.
الإتاحة: International Educational Data Mining Society. e-mail: admin@educationaldatamining.org; Web site: https://educationaldatamining.org/conferences/
Peer Reviewed: Y
Page Count: 12
تاريخ النشر: 2021
نوع الوثيقة: Speeches/Meeting Papers
Reports - Research
Education Level: Higher Education
Postsecondary Education
Descriptors: Classification, Computer Mediated Communication, Learning Analytics, Data Analysis, Artificial Intelligence, Models, Academic Achievement, Comparative Analysis, Online Courses, Universities, Integrated Learning Systems, Foreign Countries, College Faculty, Computer Software
مصطلحات جغرافية: Australia, California (Stanford)
مستخلص: Classifying educational forum posts is a longstanding task in the research of Learning Analytics and Educational Data Mining. Though this task has been tackled by applying both traditional Machine Learning (ML) approaches (e.g., Logistics Regression and Random Forest) and up-to-date Deep Learning (DL) approaches, there lacks a systematic examination of these two types of approaches to portray their performance difference. To better guide researchers and practitioners to select a model that suits their needs the best, this study aimed to systematically compare the effectiveness of these two types of approaches for this specific task. Specifically, we selected a total of six representative models and explored their capabilities by equipping them with either extensive input features that were widely used in previous studies (traditional ML models) or the state-of-the-art pre-trained language model BERT (DL models). Through extensive experiments on two real-world datasets (one is open-sourced), we demonstrated that: (i) DL models uniformly achieved better classification results than traditional ML models and the performance difference ranges from 1.85% to 5.32% with respect to different evaluation metrics; (ii) when applying traditional ML models, different features should be explored and engineered to tackle different classification tasks; (iii) when applying DL models, it tends to be a promising approach to adapt BERT to the specific classification task by fine-tuning its model parameters. [For the full proceedings, see ED615472.]
Abstractor: As Provided
Entry Date: 2021
رقم الأكسشن: ED615664
قاعدة البيانات: ERIC