A Resource-Efficient Embedded Iris Recognition System Using Fully Convolutional Networks

التفاصيل البيبلوغرافية
العنوان: A Resource-Efficient Embedded Iris Recognition System Using Fully Convolutional Networks
المؤلفون: Heng Zhao, Sherief Reda, Hokchhay Tann
المصدر: ACM Journal on Emerging Technologies in Computing Systems. 16:1-23
بيانات النشر: Association for Computing Machinery (ACM), 2019.
سنة النشر: 2019
مصطلحات موضوعية: FOS: Computer and information sciences, Speedup, Computer science, Computer Vision and Pattern Recognition (cs.CV), Pipeline (computing), Iris recognition, Computer Science - Computer Vision and Pattern Recognition, 02 engineering and technology, Encoding (memory), FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Neural and Evolutionary Computing (cs.NE), Electrical and Electronic Engineering, Field-programmable gate array, Quantization (image processing), business.industry, Deep learning, Image and Video Processing (eess.IV), Computer Science - Neural and Evolutionary Computing, Electrical Engineering and Systems Science - Image and Video Processing, 020202 computer hardware & architecture, Computer engineering, Hardware and Architecture, Hardware acceleration, 020201 artificial intelligence & image processing, Artificial intelligence, business, Software
الوصف: Applications of fully convolutional networks (FCN) in iris segmentation have shown promising advances. For mobile and embedded systems, a significant challenge is that the proposed FCN architectures are extremely computationally demanding. In this article, we propose a resource-efficient, end-to-end iris recognition flow, which consists of FCN-based segmentation and a contour fitting module, followed by Daugman normalization and encoding. To attain accurate and efficient FCN models, we propose a three-step SW/HW co-design methodology consisting of FCN architectural exploration, precision quantization, and hardware acceleration. In our exploration, we propose multiple FCN models, and in comparison to previous works, our best-performing model requires 50× fewer floating-point operations per inference while achieving a new state-of-the-art segmentation accuracy. Next, we select the most efficient set of models and further reduce their computational complexity through weights and activations quantization using an 8-bit dynamic fixed-point format. Each model is then incorporated into an end-to-end flow for true recognition performance evaluation. A few of our end-to-end pipelines outperform the previous state of the art on two datasets evaluated. Finally, we propose a novel dynamic fixed-point accelerator and fully demonstrate the SW/HW co-design realization of our flow on an embedded FPGA platform. In comparison with the embedded CPU, our hardware acceleration achieves up to 8.3× speedup for the overall pipeline while using less than 15% of the available FPGA resources. We also provide comparisons between the FPGA system and an embedded GPU showing different benefits and drawbacks for the two platforms.
تدمد: 1550-4840
1550-4832
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f3a0bf626c4005969d6e78656f4fa72c
https://doi.org/10.1145/3357796
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....f3a0bf626c4005969d6e78656f4fa72c
قاعدة البيانات: OpenAIRE