Deep Mamba Multi-modal Learning

التفاصيل البيبلوغرافية
العنوان: Deep Mamba Multi-modal Learning
المؤلفون: Zhu, Jian, Zou, Xin, Cui, Yu, Huang, Zhangmin, Hu, Chenshu, Lyu, Bo
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Multimedia
الوصف: Inspired by the excellent performance of Mamba networks, we propose a novel Deep Mamba Multi-modal Learning (DMML). It can be used to achieve the fusion of multi-modal features. We apply DMML to the field of multimedia retrieval and propose an innovative Deep Mamba Multi-modal Hashing (DMMH) method. It combines the advantages of algorithm accuracy and inference speed. We validated the effectiveness of DMMH on three public datasets and achieved state-of-the-art results.
Comment: Deep Mamba Multi-modal Learning; Deep Mamba Multi-modal Hashing
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2406.18007
رقم الأكسشن: edsarx.2406.18007
قاعدة البيانات: arXiv