Mutual Support of Data Modalities in the Task of Sign Language Recognition

التفاصيل البيبلوغرافية
العنوان: Mutual Support of Data Modalities in the Task of Sign Language Recognition
المؤلفون: Ivan Gruber, Zdenek Krnoul, Marek Hrúz, Matyas Bohacek, Jakub Kanis
المصدر: CVPR Workshops
بيانات النشر: IEEE, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Scheme (programming language), Modalities, Computer science, business.industry, sign language recognition, Pattern recognition, Sign language, Semantics, konvoluční neuronové sítě, Visualization, Gesture recognition, convolutional neural networks, RGB color model, Artificial intelligence, rozpoznávání znakového jazyka, business, computer, Transformer (machine learning model), computer.programming_language
الوصف: This paper presents a method for automatic sign language recognition that was utilized in the CVPR 2021 ChaLearn Challenge (RGB track). Our method is composed of several approaches combined in an ensemble scheme to perform isolated sign-gesture recognition. We combine modalities of video sample frames processed by a 3D ConvNet (I3D), with body-pose information in the form of joint locations processed by a Transformer, hand region images transformed into a semantic space, and linguistically defined locations of hands. Although the individual models perform sub-par (60% to 93% accuracy on validation data), the weighted ensemble results in 95.46% accuracy.
وصف الملف: application/pdf
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7313864f15e232a39ed75207c57f8c2e
https://doi.org/10.1109/cvprw53098.2021.00381
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....7313864f15e232a39ed75207c57f8c2e
قاعدة البيانات: OpenAIRE