The curious case of the test set AUROC

التفاصيل البيبلوغرافية
العنوان: The curious case of the test set AUROC
المؤلفون: Roberts, Michael, Hazan, Alon, Dittmer, Sören, Rudd, James H. F., Schönlieb, Carola-Bibiane
سنة النشر: 2023
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Computer Science - Machine Learning, Statistics - Methodology
الوصف: Whilst the size and complexity of ML models have rapidly and significantly increased over the past decade, the methods for assessing their performance have not kept pace. In particular, among the many potential performance metrics, the ML community stubbornly continues to use (a) the area under the receiver operating characteristic curve (AUROC) for a validation and test cohort (distinct from training data) or (b) the sensitivity and specificity for the test data at an optimal threshold determined from the validation ROC. However, we argue that considering scores derived from the test ROC curve alone gives only a narrow insight into how a model performs and its ability to generalise.
Comment: 3 pages, 4 figures
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2312.16188
رقم الأكسشن: edsarx.2312.16188
قاعدة البيانات: arXiv