دورية أكاديمية

DiffusionFR: Species Recognition of Fish in Blurry Scenarios via Diffusion and Attention

التفاصيل البيبلوغرافية
العنوان: DiffusionFR: Species Recognition of Fish in Blurry Scenarios via Diffusion and Attention
المؤلفون: Guoying Wang, Bing Shi, Xiaomei Yi, Peng Wu, Linjun Kong, Lufeng Mo
المصدر: Animals, Vol 14, Iss 3, p 499 (2024)
بيانات النشر: MDPI AG, 2024.
سنة النشر: 2024
المجموعة: LCC:Veterinary medicine
LCC:Zoology
مصطلحات موضوعية: blurry scenarios, fish recognition, deep learning, diffusion models, Veterinary medicine, SF600-1100, Zoology, QL1-991
الوصف: Blurry scenarios, such as light reflections and water ripples, often affect the clarity and signal-to-noise ratio of fish images, posing significant challenges for traditional deep learning models in accurately recognizing fish species. Firstly, deep learning models rely on a large amount of labeled data. However, it is often difficult to label data in blurry scenarios. Secondly, existing deep learning models need to be more effective for the processing of bad, blurry, and otherwise inadequate images, which is an essential reason for their low recognition rate. A method based on the diffusion model and attention mechanism for fish image recognition in blurry scenarios, DiffusionFR, is proposed to solve these problems and improve the performance of species recognition of fish images in blurry scenarios. This paper presents the selection and application of this correcting technique. In the method, DiffusionFR, a two-stage diffusion network model, TSD, is designed to deblur bad, blurry, and otherwise inadequate fish scene pictures to restore clarity, and a learnable attention module, LAM, is intended to improve the accuracy of fish recognition. In addition, a new dataset of fish images in blurry scenarios, BlurryFish, was constructed and used to validate the effectiveness of DiffusionFR, combining bad, blurry, and otherwise inadequate images from the publicly available dataset Fish4Knowledge. The experimental results demonstrate that DiffusionFR achieves outstanding performance on various datasets. On the original dataset, DiffusionFR achieved the highest training accuracy of 97.55%, as well as a Top-1 accuracy test score of 92.02% and a Top-5 accuracy test score of 95.17%. Furthermore, on nine datasets with light reflection noise, the mean values of training accuracy reached a peak at 96.50%, while the mean values of the Top-1 accuracy test and Top-5 accuracy test were at their highest at 90.96% and 94.12%, respectively. Similarly, on three datasets with water ripple noise, the mean values of training accuracy reached a peak at 95.00%, while the mean values of the Top-1 accuracy test and Top-5 accuracy test were at their highest at 89.54% and 92.73%, respectively. These results demonstrate that the method showcases superior accuracy and enhanced robustness in handling original datasets and datasets with light reflection and water ripple noise.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2076-2615
Relation: https://www.mdpi.com/2076-2615/14/3/499; https://doaj.org/toc/2076-2615
DOI: 10.3390/ani14030499
URL الوصول: https://doaj.org/article/c7a8f8ab9cb047cdbf03d08adf99aecb
رقم الأكسشن: edsdoj.7a8f8ab9cb047cdbf03d08adf99aecb
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20762615
DOI:10.3390/ani14030499