PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement

التفاصيل البيبلوغرافية
العنوان: PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement
المؤلفون: Yan, Zhixin, Zhong, Yan, Zhang, Wenjun, Shu, Lin, Xu, Hongbin, Kang, Wenxiong
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Remote Photoplethysmography (rPPG) is a non-contact technique for extracting physiological signals from facial videos, used in applications like emotion monitoring, medical assistance, and anti-face spoofing. Unlike controlled laboratory settings, real-world environments often contain motion artifacts and noise, affecting the performance of existing methods. To address this, we propose PhysMamba, a dual-stream time-frequency interactive model based on Mamba. PhysMamba integrates the state-of-the-art Mamba-2 model and employs a dual-stream architecture to learn diverse rPPG features, enhancing robustness in noisy conditions. Additionally, we designed the Cross-Attention State Space Duality (CASSD) module to improve information exchange and feature complementarity between the two streams. We validated PhysMamba using PURE, UBFC-rPPG and MMPD. Experimental results show that PhysMamba achieves state-of-the-art performance across various scenarios, particularly in complex environments, demonstrating its potential in practical remote heart rate monitoring applications.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2408.01077
رقم الأكسشن: edsarx.2408.01077
قاعدة البيانات: arXiv