Attentional feature pyramid network for small object detection

التفاصيل البيبلوغرافية
العنوان: Attentional feature pyramid network for small object detection
المؤلفون: Kyungseo Min, Gun-Hee Lee, Seong-Whan Lee
المصدر: Neural Networks. 155:439-450
بيانات النشر: Elsevier BV, 2022.
سنة النشر: 2022
مصطلحات موضوعية: Volatile Organic Compounds, Artificial Intelligence, Cognitive Neuroscience, Attention, Cues
الوصف: Recent state-of-the-art detectors generally exploit the Feature Pyramid Networks (FPN) due to its advantage of detecting objects at different scales. Despite significant advances in object detection owing to the design of feature pyramids, it is still challenging to detect small objects with low resolution and dense distribution in complex scenes. To address these problems, we propose Attentional Feature Pyramid Network, a new feature pyramid architecture named AFPN which consists of three components to enhance the small object detection ability, specifically: Dynamic Texture Attention, Foreground-Aware Co-Attention, and Detail Context Attention. First, Dynamic Texture Attention augments the texture features dynamically by filtering out redundant semantics to highlight small objects in lower layers and amplifying credible details to emphasize large objects in higher layers. Then, Foreground-Aware Co-Attention is explored to detect densely arranged small objects by enhancing the objects feature via foreground-correlated contexts and suppressing the background noise. Finally, to better capture the features of small objects, Detail Context Attention adaptively aggregates detail cues of RoI features with different scales for a more accurate feature representation. By substituting FPN with AFPN in Faster R-CNN, our method performs on par with the state-of-the-art performance on Tsinghua-Tencent 100K. Furthermore, we achieve highly competitive results on small category of both PASCAL VOC and MS COCO.
تدمد: 0893-6080
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::404ac57b6991ae8533817d4b402d3889
https://doi.org/10.1016/j.neunet.2022.08.029
حقوق: CLOSED
رقم الأكسشن: edsair.doi.dedup.....404ac57b6991ae8533817d4b402d3889
قاعدة البيانات: OpenAIRE