Image Aesthetics Assessment via Learnable Queries

التفاصيل البيبلوغرافية
العنوان: Image Aesthetics Assessment via Learnable Queries
المؤلفون: Xiong, Zhiwei, Zhang, Yunfan, Shen, Zhiqi, Ren, Peiran, Yu, Han
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Image aesthetics assessment (IAA) aims to estimate the aesthetics of images. Depending on the content of an image, diverse criteria need to be selected to assess its aesthetics. Existing works utilize pre-trained vision backbones based on content knowledge to learn image aesthetics. However, training those backbones is time-consuming and suffers from attention dispersion. Inspired by learnable queries in vision-language alignment, we propose the Image Aesthetics Assessment via Learnable Queries (IAA-LQ) approach. It adapts learnable queries to extract aesthetic features from pre-trained image features obtained from a frozen image encoder. Extensive experiments on real-world data demonstrate the advantages of IAA-LQ, beating the best state-of-the-art method by 2.2% and 2.1% in terms of SRCC and PLCC, respectively.
Comment: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2309.02861
رقم الأكسشن: edsarx.2309.02861
قاعدة البيانات: arXiv