دورية أكاديمية

Nighttime vehicle detection algorithm based on image translation technology1.

التفاصيل البيبلوغرافية
العنوان: Nighttime vehicle detection algorithm based on image translation technology1.
المؤلفون: Wu, Yixun, Wang, Taiyu, Gu, Runze, Liu, Chao, Xu, Boqiang
المصدر: Journal of Intelligent & Fuzzy Systems; 2024, Vol. 46 Issue 2, p5377-5389, 13p
مصطلحات موضوعية: OBJECT recognition (Computer vision), INTELLIGENT transportation systems, GENERATIVE adversarial networks, COMPUTER vision, COMPARATIVE method, VIDEO monitors
مستخلص: In order to address the problem of decreased accuracy in vehicle object detection models when facing low-light conditions in nighttime environments, this paper proposes a method to enhance the accuracy and precision of object detection by using the image translation technology based on the Generative Adversarial Network (GAN) in the field of computer vision, specifically the CycleGAN, from the perspective of improving the training set of object detection models. This is achieved by transforming the existing well-established daytime vehicle dataset into a nighttime vehicle dataset. The proposed method adopts a comparative experimental approach to obtain translation models with different degrees of fitting by changing the training set capacity, and selects the optimal model based on the evaluation of the effect. The translated dataset is then used to train the YOLO-v5-based object detection model, and the quality of the nighttime dataset is evaluated through the evaluation of annotation confidence and effectiveness. The research results indicate that utilizing the translated nighttime vehicle dataset for training the object detection model can increase the area under the PR curve and the peak F1 score by 10.4% and 9% respectively. This approach improves the annotation accuracy and precision of vehicle object detection models in nighttime environments without requiring additional labeling of vehicles in monitoring videos. [ABSTRACT FROM AUTHOR]
Copyright of Journal of Intelligent & Fuzzy Systems is the property of IOS Press and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:10641246
DOI:10.3233/JIFS-233899