Vision Transformer with 2D Explicit Position Encoding

التفاصيل البيبلوغرافية
العنوان: Vision Transformer with 2D Explicit Position Encoding
المؤلفون: Li, Yujie, Ma, Zihang, Wang, Xinghe, Wang, Yifu, Tan, Benying
المصدر: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2024 - 2024 IEEE International Conference on. :7690-7694 Apr, 2024
Relation: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
قاعدة البيانات: IEEE Xplore Digital Library
الوصف
ردمك:9798350344851
تدمد:2379190X
DOI:10.1109/ICASSP48485.2024.10446293