دورية أكاديمية

PointMapNet: Point Cloud Feature Map Network for 3D Human Action Recognition

التفاصيل البيبلوغرافية
العنوان: PointMapNet: Point Cloud Feature Map Network for 3D Human Action Recognition
المؤلفون: Xing Li, Qian Huang, Yunfei Zhang, Tianjin Yang, Zhijian Wang
المصدر: Symmetry, Vol 15, Iss 2, p 363 (2023)
بيانات النشر: MDPI AG, 2023.
سنة النشر: 2023
المجموعة: LCC:Mathematics
مصطلحات موضوعية: 3D human action recognition, point cloud sequence, point cloud feature map, Mathematics, QA1-939
الوصف: 3D human action recognition is crucial in broad industrial application scenarios such as robotics, video surveillance, autonomous driving, or intellectual education, etc. In this paper, we present a new point cloud sequence network called PointMapNet for 3D human action recognition. In PointMapNet, two point cloud feature maps symmetrical to depth feature maps are proposed to summarize appearance and motion representations from point cloud sequences. Specifically, we first convert the point cloud frames to virtual action frames using static point cloud techniques. The virtual action frame is a 1D vector used to characterize the structural details in the point cloud frame. Then, inspired by feature map-based human action recognition on depth sequences, two point cloud feature maps are symmetrically constructed to recognize human action from the point cloud sequence, i.e., Point Cloud Appearance Map (PCAM) and Point Cloud Motion Map (PCMM). To construct PCAM, an MLP-like network architecture is designed and used to capture the spatio-temporal appearance feature of the human action in a virtual action sequence. To construct PCMM, the MLP-like network architecture is used to capture the motion feature of the human action in a virtual action difference sequence. Finally, the two point cloud feature map descriptors are concatenated and fed to a fully connected classifier for human action recognition. In order to evaluate the performance of the proposed approach, extensive experiments are conducted. The proposed method achieves impressive results on three benchmark datasets, namely NTU RGB+D 60 (89.4% cross-subject and 96.7% cross-view), UTD-MHAD (91.61%), and MSR Action3D (91.91%). The experimental results outperform existing state-of-the-art point cloud sequence classification networks, demonstrating the effectiveness of our method.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 15020363
2073-8994
Relation: https://www.mdpi.com/2073-8994/15/2/363; https://doaj.org/toc/2073-8994
DOI: 10.3390/sym15020363
URL الوصول: https://doaj.org/article/f499269bfb644de39cdc78ed976463d1
رقم الأكسشن: edsdoj.f499269bfb644de39cdc78ed976463d1
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:15020363
20738994
DOI:10.3390/sym15020363