Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation

التفاصيل البيبلوغرافية
العنوان: Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation
المؤلفون: Chu, Tianyi, Xing, Wei, Chen, Jiafu, Wang, Zhizhong, Sun, Jiakai, Zhao, Lei, Chen, Haibo, Lin, Huaizhong
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Existing generative adversarial network (GAN) based conditional image generative models typically produce fixed output for the same conditional input, which is unreasonable for highly subjective tasks, such as large-mask image inpainting or style transfer. On the other hand, GAN-based diverse image generative methods require retraining/fine-tuning the network or designing complex noise injection functions, which is computationally expensive, task-specific, or struggle to generate high-quality results. Given that many deterministic conditional image generative models have been able to produce high-quality yet fixed results, we raise an intriguing question: is it possible for pre-trained deterministic conditional image generative models to generate diverse results without changing network structures or parameters? To answer this question, we re-examine the conditional image generation tasks from the perspective of adversarial attack and propose a simple and efficient plug-in projected gradient descent (PGD) like method for diverse and controllable image generation. The key idea is attacking the pre-trained deterministic generative models by adding a micro perturbation to the input condition. In this way, diverse results can be generated without any adjustment of network structures or fine-tuning of the pre-trained models. In addition, we can also control the diverse results to be generated by specifying the attack direction according to a reference text or image. Our work opens the door to applying adversarial attack to low-level vision tasks, and experiments on various conditional image generation tasks demonstrate the effectiveness and superiority of the proposed method.
Comment: 9 pages, 7 figures, accepted by AAAI24
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2403.08294
رقم الأكسشن: edsarx.2403.08294
قاعدة البيانات: arXiv