دورية أكاديمية

A graph convolutional neural network model with Fisher vector encoding and channel‐wise spatial‐temporal aggregation for skeleton‐based action recognition

التفاصيل البيبلوغرافية
العنوان: A graph convolutional neural network model with Fisher vector encoding and channel‐wise spatial‐temporal aggregation for skeleton‐based action recognition
المؤلفون: Jun Tang, Yanjiang Wang, Sichao Fu, Baodi Liu, Weifeng Liu
المصدر: IET Image Processing, Vol 16, Iss 5, Pp 1433-1443 (2022)
بيانات النشر: Wiley, 2022.
سنة النشر: 2022
المجموعة: LCC:Computer software
مصطلحات موضوعية: Photography, TR1-1050, Computer software, QA76.75-76.765
الوصف: Abstract Skeleton‐based action recognition is an inspired yet challenging task in computer vision. Recently, the latest graph convolutional network (GCN), which generalises well‐established convolutional neural networks to non‐Euclidean structures, is proven to be highly successful for action recognition from body skeleton data. However, the GCN architecture has not been fully studied. In this work, a Fisher vector (FV) encoding based GCN architecture (FV‐GCN) is proposed, which exceeds the limitations of existing GCN‐based methods by combining the GCN model with FV encoding. A channel‐wise spatial–temporal aggregation function to preserve spatial–temporal information in the whole action clip and integrate it into the FV‐GCN architecture is also presented. Since FV is different from the GCN structure, this hybrid architecture that incorporates the advantages of both algorithms can discover complementary information of feature representation effectively. On two challenging human action datasets, kinetics, and NTU‐RGBD, improved performance is demonstrated over the baseline method, and the FV‐GCN is better or comparable to some state‐of‐the‐art methods.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1751-9667
1751-9659
Relation: https://doaj.org/toc/1751-9659; https://doaj.org/toc/1751-9667
DOI: 10.1049/ipr2.12422
URL الوصول: https://doaj.org/article/0aa9c3c801774023bbb4f1c6aaa7be7b
رقم الأكسشن: edsdoj.0aa9c3c801774023bbb4f1c6aaa7be7b
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:17519667
17519659
DOI:10.1049/ipr2.12422