دورية أكاديمية

Feature Engineering and Ensemble-Based Approach for Improving Automatic Short-Answer Grading Performance

التفاصيل البيبلوغرافية
العنوان: Feature Engineering and Ensemble-Based Approach for Improving Automatic Short-Answer Grading Performance
اللغة: English
المؤلفون: Sahu, Archana (ORCID 0000-0001-8724-6158), Bhowmick, Plaban Kumar
المصدر: IEEE Transactions on Learning Technologies. Jan-Mar 2020 13(1):77-90.
الإتاحة: Institute of Electrical and Electronics Engineers, Inc. 445 Hoes Lane, Piscataway, NJ 08854. Tel: 732-981-0060; Web site: http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4620076
Peer Reviewed: Y
Page Count: 14
تاريخ النشر: 2020
نوع الوثيقة: Journal Articles
Reports - Research
Education Level: Higher Education
Descriptors: Automation, Grading, Test Format, Artificial Intelligence, Models, Technology Uses in Education, Classification
DOI: 10.1109/TLT.2019.2897997
تدمد: 1939-1382
مستخلص: In this paper, we studied different automatic short answer grading (ASAG) systems to provide a comprehensive view of the feature spaces explored by previous works. While the performance reported in previous works have been encouraging, systematic study of the features is lacking. Apart from providing systematic feature space exploration, we also presented ensemble methods that have been experimentally validated to exhibit significantly higher grading performance over the existing papers in almost all the datasets in ASAG domain. A comparative study over different features and regression models toward short-answer grading has been performed with respect to evaluation metrics used in evaluating ASAG. Apart from traditional text similarity based features like WordNet similarity, latent semantic analysis, and others, we have introduced novel features like "topic models" suited for short text, "relevance feedback" based features. An ensemble-based model has been built using a combination of different regression models with an approach based on "stacked regression". The proposed ASAG has been tested on the University of North Texas dataset for the regression task, whereas in case of classification task, the student response analysis (SRA) based ScientsBank and Beetle corpus have been used for evaluation. The grading performance in case of ensemble-based ASAG is highly boosted from that exhibited by an individual regression model. Extensive experimentation has revealed that feature selection, introduction of novel features, and regressor stacking have been instrumental in achieving considerable improvement in performance over the existing methods in ASAG domain.
Abstractor: As Provided
Entry Date: 2020
رقم الأكسشن: EJ1248103
قاعدة البيانات: ERIC
الوصف
تدمد:1939-1382
DOI:10.1109/TLT.2019.2897997