تقرير
Ophthalmic Biomarker Detection Using Ensembled Vision Transformers -- Winning Solution to IEEE SPS VIP Cup 2023
العنوان: | Ophthalmic Biomarker Detection Using Ensembled Vision Transformers -- Winning Solution to IEEE SPS VIP Cup 2023 |
---|---|
المؤلفون: | Shahgir, H. A. Z. Sameen, Sayeed, Khondker Salman, Zaman, Tanjeem Azwad, Haider, Md. Asif, Jony, Sheikh Saifur Rahman, Rahman, M. Sohel |
سنة النشر: | 2023 |
المجموعة: | Computer Science |
مصطلحات موضوعية: | Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition |
الوصف: | This report outlines our approach in the IEEE SPS VIP Cup 2023: Ophthalmic Biomarker Detection competition. Our primary objective in this competition was to identify biomarkers from Optical Coherence Tomography (OCT) images obtained from a diverse range of patients. Using robust augmentations and 5-fold cross-validation, we trained two vision transformer-based models: MaxViT and EVA-02, and ensembled them at inference time. We find MaxViT's use of convolution layers followed by strided attention to be better suited for the detection of local features while EVA-02's use of normal attention mechanism and knowledge distillation is better for detecting global features. Ours was the best-performing solution in the competition, achieving a patient-wise F1 score of 0.814 in the first phase and 0.8527 in the second and final phase of VIP Cup 2023, scoring 3.8% higher than the next-best solution. |
نوع الوثيقة: | Working Paper |
URL الوصول: | http://arxiv.org/abs/2310.14005 |
رقم الأكسشن: | edsarx.2310.14005 |
قاعدة البيانات: | arXiv |
كن أول من يترك تعليقا!