Progressive Evolution from Single-Point to Polygon for Scene Text

التفاصيل البيبلوغرافية
العنوان: Progressive Evolution from Single-Point to Polygon for Scene Text
المؤلفون: Deng, Linger, Huang, Mingxin, Xie, Xudong, Liu, Yuliang, Jin, Lianwen, Bai, Xiang
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: The advancement of text shape representations towards compactness has enhanced text detection and spotting performance, but at a high annotation cost. Current models use single-point annotations to reduce costs, yet they lack sufficient localization information for downstream applications. To overcome this limitation, we introduce Point2Polygon, which can efficiently transform single-points into compact polygons. Our method uses a coarse-to-fine process, starting with creating and selecting anchor points based on recognition confidence, then vertically and horizontally refining the polygon using recognition information to optimize its shape. We demonstrate the accuracy of the generated polygons through extensive experiments: 1) By creating polygons from ground truth points, we achieved an accuracy of 82.0% on ICDAR 2015; 2) In training detectors with polygons generated by our method, we attained 86% of the accuracy relative to training with ground truth (GT); 3) Additionally, the proposed Point2Polygon can be seamlessly integrated to empower single-point spotters to generate polygons. This integration led to an impressive 82.5% accuracy for the generated polygons. It is worth mentioning that our method relies solely on synthetic recognition information, eliminating the need for any manual annotation beyond single points.
Comment: Accepted in ICDAR 2024
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2312.13778
رقم الأكسشن: edsarx.2312.13778
قاعدة البيانات: arXiv