Improving Multi-Person Pose Tracking with A Confidence Network

التفاصيل البيبلوغرافية
العنوان: Improving Multi-Person Pose Tracking with A Confidence Network
المؤلفون: Fu, Zehua, Zuo, Wenhang, Hu, Zhenghui, Liu, Qingjie, Wang, Yunhong
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Human pose estimation and tracking are fundamental tasks for understanding human behaviors in videos. Existing top-down framework-based methods usually perform three-stage tasks: human detection, pose estimation and tracking. Although promising results have been achieved, these methods rely heavily on high-performance detectors and may fail to track persons who are occluded or miss-detected. To overcome these problems, in this paper, we develop a novel keypoint confidence network and a tracking pipeline to improve human detection and pose estimation in top-down approaches. Specifically, the keypoint confidence network is designed to determine whether each keypoint is occluded, and it is incorporated into the pose estimation module. In the tracking pipeline, we propose the Bbox-revision module to reduce missing detection and the ID-retrieve module to correct lost trajectories, improving the performance of the detection stage. Experimental results show that our approach is universal in human detection and pose estimation, achieving state-of-the-art performance on both PoseTrack 2017 and 2018 datasets.
Comment: Accepted by IEEE Transactions on Multimedia. 11 pages, 5 figures
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2310.18920
رقم الأكسشن: edsarx.2310.18920
قاعدة البيانات: arXiv