Context-PIPs: Persistent Independent Particles Demands Spatial Context Features

التفاصيل البيبلوغرافية
العنوان: Context-PIPs: Persistent Independent Particles Demands Spatial Context Features
المؤلفون: Bian, Weikang, Huang, Zhaoyang, Shi, Xiaoyu, Dong, Yitong, Li, Yijin, Li, Hongsheng
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: We tackle the problem of Persistent Independent Particles (PIPs), also called Tracking Any Point (TAP), in videos, which specifically aims at estimating persistent long-term trajectories of query points in videos. Previous methods attempted to estimate these trajectories independently to incorporate longer image sequences, therefore, ignoring the potential benefits of incorporating spatial context features. We argue that independent video point tracking also demands spatial context features. To this end, we propose a novel framework Context-PIPs, which effectively improves point trajectory accuracy by aggregating spatial context features in videos. Context-PIPs contains two main modules: 1) a SOurse Feature Enhancement (SOFE) module, and 2) a TArget Feature Aggregation (TAFA) module. Context-PIPs significantly improves PIPs all-sided, reducing 11.4% Average Trajectory Error of Occluded Points (ATE-Occ) on CroHD and increasing 11.8% Average Percentage of Correct Keypoint (A-PCK) on TAP-Vid-Kinectics. Demos are available at https://wkbian.github.io/Projects/Context-PIPs/.
Comment: Project Page: https://wkbian.github.io/Projects/Context-PIPs/
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2306.02000
رقم الأكسشن: edsarx.2306.02000
قاعدة البيانات: arXiv