DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

التفاصيل البيبلوغرافية
العنوان: DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
المؤلفون: Wang, Yibo, Gao, Ruiyuan, Chen, Kai, Zhou, Kaiqiang, Cai, Yingjie, Hong, Lanqing, Li, Zhenguo, Jiang, Lihui, Yeung, Dit-Yan, Xu, Qiang, Zhang, Kai
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Current perceptive models heavily depend on resource-intensive datasets, prompting the need for innovative solutions. Leveraging recent advances in diffusion models, synthetic data, by constructing image inputs from various annotations, proves beneficial for downstream tasks. While prior methods have separately addressed generative and perceptive models, DetDiffusion, for the first time, harmonizes both, tackling the challenges in generating effective data for perceptive models. To enhance image generation with perceptive models, we introduce perception-aware loss (P.A. loss) through segmentation, improving both quality and controllability. To boost the performance of specific perceptive models, our method customizes data augmentation by extracting and utilizing perception-aware attribute (P.A. Attr) during generation. Experimental results from the object detection task highlight DetDiffusion's superior performance, establishing a new state-of-the-art in layout-guided generation. Furthermore, image syntheses from DetDiffusion can effectively augment training data, significantly enhancing downstream detection performance.
Comment: Accepted to CVPR 2024
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2403.13304
رقم الأكسشن: edsarx.2403.13304
قاعدة البيانات: arXiv