Revisiting the Disequilibrium Issues in Tackling Heart Disease Classification Tasks

التفاصيل البيبلوغرافية
العنوان: Revisiting the Disequilibrium Issues in Tackling Heart Disease Classification Tasks
المؤلفون: Hoang, Thao, Nguyen, Linh, Do, Khoi, Nguyen, Duong, Nguyen, Viet Dung
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
الوصف: In the field of heart disease classification, two primary obstacles arise. Firstly, existing Electrocardiogram (ECG) datasets consistently demonstrate imbalances and biases across various modalities. Secondly, these time-series data consist of diverse lead signals, causing Convolutional Neural Networks (CNNs) to become overfitting to the one with higher power, hence diminishing the performance of the Deep Learning (DL) process. In addition, when facing an imbalanced dataset, performance from such high-dimensional data may be susceptible to overfitting. Current efforts predominantly focus on enhancing DL models by designing novel architectures, despite these evident challenges, seemingly overlooking the core issues, therefore hindering advancements in heart disease classification. To address these obstacles, our proposed approach introduces two straightforward and direct methods to enhance the classification tasks. To address the high dimensionality issue, we employ a Channel-wise Magnitude Equalizer (CME) on signal-encoded images. This approach reduces redundancy in the feature data range, highlighting changes in the dataset. Simultaneously, to counteract data imbalance, we propose the Inverted Weight Logarithmic Loss (IWL) to alleviate imbalances among the data. When applying IWL loss, the accuracy of state-of-the-art models (SOTA) increases up to 5% in the CPSC2018 dataset. CME in combination with IWL also surpasses the classification results of other baseline models from 5% to 10%.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.20249
رقم الأكسشن: edsarx.2407.20249
قاعدة البيانات: arXiv