دورية أكاديمية

Automatic detection method for tobacco beetles combining multi-scale global residual feature pyramid network and dual-path deformable attention

التفاصيل البيبلوغرافية
العنوان: Automatic detection method for tobacco beetles combining multi-scale global residual feature pyramid network and dual-path deformable attention
المؤلفون: Yuling Chen, Xiaoxia Li, Nianzu Lv, Zhenxiang He, Bin Wu
المصدر: Scientific Reports, Vol 14, Iss 1, Pp 1-16 (2024)
بيانات النشر: Nature Portfolio, 2024.
سنة النشر: 2024
المجموعة: LCC:Medicine
LCC:Science
مصطلحات موضوعية: Tobacco beetle, Small object, Tobacco beetle dataset, Deformable convolutional, Attention mechanism, Feature pyramid network, Medicine, Science
الوصف: Abstract Aiming at the problems of identifying storage pest tobacco pest beetles from images that have few object pixels and considerable image noise, and therefore suffer from lack of information and identifiable features, this paper proposes an automatic monitoring method of tobacco beetle based on Multi-scale Global residual Feature Pyramid Network and Dual-path Deformable Attention (MGrFPN-DDrGAM). Firstly, a Multi-scale Global residual Feature Pyramid Network (MGrFPN) is constructed to obtain rich high-level semantic features and more complete information on low-level features to reduce missed detection; Then, a Dual-path Deformable receptive field Guided Attention Module (DDrGAM) is designed to establish long-range channel dependence, guide the effective fusion of features and improve the localization accuracy of tobacco beetles by fitting the spatial geometric deformation features of and capturing the spatial information of feature maps with different scales to enrich the feature information in the channel and spatial. Finally, to simulate a real scene, a multi-scene tobacco beetle dataset is created. The dataset includes 28,080 images and manually labeled tobacco beetle objects. The experimental results show that under the framework of the Faster R-CNN algorithm, the detection precision and recall rate of this method can reach 91.4% and 98.4% when the intersection ratio (IoU) is 0.5. Compared with Faster R-CNN and FPN, when the intersection ratio (IoU) is 0.7, the detection precision is improved by 32.9% and 6.9%, respectively. The proposed method is superior to the current mainstream methods.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2045-2322
Relation: https://doaj.org/toc/2045-2322
DOI: 10.1038/s41598-024-55347-4
URL الوصول: https://doaj.org/article/eba96718bb8c408aa6e2b13a098ad432
رقم الأكسشن: edsdoj.ba96718bb8c408aa6e2b13a098ad432
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20452322
DOI:10.1038/s41598-024-55347-4