Joint semi-supervised and contrastive learning enables zero-shot domain-adaptation and multi-domain segmentation

التفاصيل البيبلوغرافية
العنوان: Joint semi-supervised and contrastive learning enables zero-shot domain-adaptation and multi-domain segmentation
المؤلفون: Gomariz, Alvaro, Kikuchi, Yusuke, Li, Yun Yvonna, Albrecht, Thomas, Maunz, Andreas, Ferrara, Daniela, Lu, Huanxiang, Goksel, Orcun
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
الوصف: Despite their effectiveness, current deep learning models face challenges with images coming from different domains with varying appearance and content. We introduce SegCLR, a versatile framework designed to segment volumetric images across different domains, employing supervised and contrastive learning simultaneously to effectively learn from both labeled and unlabeled data. We demonstrate the superior performance of SegCLR through a comprehensive evaluation involving three diverse clinical datasets of retinal fluid segmentation in 3D Optical Coherence Tomography (OCT), various network configurations, and verification across 10 different network initializations. In an unsupervised domain adaptation context, SegCLR achieves results on par with a supervised upper-bound model trained on the intended target domain. Notably, we discover that the segmentation performance of SegCLR framework is marginally impacted by the abundance of unlabeled data from the target domain, thereby we also propose an effective zero-shot domain adaptation extension of SegCLR, eliminating the need for any target domain information. This shows that our proposed addition of contrastive loss in standard supervised training for segmentation leads to superior models, inherently more generalizable to both in- and out-of-domain test data. We additionally propose a pragmatic solution for SegCLR deployment in realistic scenarios with multiple domains containing labeled data. Accordingly, our framework pushes the boundaries of deep-learning based segmentation in multi-domain applications, regardless of data availability - labeled, unlabeled, or nonexistent.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2405.05336
رقم الأكسشن: edsarx.2405.05336
قاعدة البيانات: arXiv