Masked Diffusion as Self-supervised Representation Learner

التفاصيل البيبلوغرافية
العنوان: Masked Diffusion as Self-supervised Representation Learner
المؤلفون: Pan, Zixuan, Chen, Jianxu, Shi, Yiyu
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Denoising diffusion probabilistic models have recently demonstrated state-of-the-art generative performance and have been used as strong pixel-level representation learners. This paper decomposes the interrelation between the generative capability and representation learning ability inherent in diffusion models. We present the masked diffusion model (MDM), a scalable self-supervised representation learner for semantic segmentation, substituting the conventional additive Gaussian noise of traditional diffusion with a masking mechanism. Our proposed approach convincingly surpasses prior benchmarks, demonstrating remarkable advancements in both medical and natural image semantic segmentation tasks, particularly in few-shot scenarios.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2308.05695
رقم الأكسشن: edsarx.2308.05695
قاعدة البيانات: arXiv