Vision Transformers are Parameter-Efficient Audio-Visual Learners

التفاصيل البيبلوغرافية
العنوان: Vision Transformers are Parameter-Efficient Audio-Visual Learners
المؤلفون: Lin, Yan-Bo, Sung, Yi-Lin, Lei, Jie, Bansal, Mohit, Bertasius, Gedas
المصدر: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2023 IEEE/CVF Conference on. :2299-2309 Jun, 2023
Relation: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
قاعدة البيانات: IEEE Xplore Digital Library
الوصف
ردمك:9798350301298
تدمد:25757075
DOI:10.1109/CVPR52729.2023.00228