Lightweight Deep Neural Network Accelerators Using Approximate SW/HW Techniques

التفاصيل البيبلوغرافية
العنوان: Lightweight Deep Neural Network Accelerators Using Approximate SW/HW Techniques
المؤلفون: Hokchhay Tann, Sherief Reda, Soheil Hashemi
المصدر: Approximate Circuits ISBN: 9783319993218
Approximate Circuits
بيانات النشر: Springer International Publishing, 2018.
سنة النشر: 2018
مصطلحات موضوعية: Artificial neural network, Computer science, Design flow, Latency (audio), Process (computing), Ranging, 02 engineering and technology, 020202 computer hardware & architecture, Reduction (complexity), Computer engineering, Limit (music), Binary data, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing
الوصف: Deep neural networks (DNNs) provide state-of-the-art accuracy performances in many application domains, such as computer vision and speech recognition. At the same time, DNNs require millions of expensive floating-point operations to process each input, which limit their applicability to resource-constrained systems that are limited in hardware design area or power consumption. Our goal is to devise lightweight, approximate accelerators for DNN accelerations that use less hardware resources with negligible reduction in accuracy. To simplify the hardware requirements, we analyze a spectrum of data precision methods ranging from fixed-point, dynamic fixed-point, powers-of-two to binary data precision. In conjunction, we provide new training methods to compensate for the simpler hardware resources. To boost the accuracy of the proposed lightweight accelerators, we describe ensemble processing techniques that use an ensemble of lightweight DNN accelerators to achieve the same or better accuracy than the original floating-point accelerator, while still using much less hardware resources. Using 65 nm technology libraries and industrial-strength design flow, we demonstrate a custom hardware accelerator design and training procedure which achieve low-power, low-latency while incurring insignificant accuracy degradation. We evaluate our design and technique on the CIFAR-10 and ImageNet datasets and show that significant reduction in power and inference latency is realized.
ردمك: 978-3-319-99321-8
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::817e638360d3aa40d360c3b4b3cedf77
https://doi.org/10.1007/978-3-319-99322-5_14
حقوق: CLOSED
رقم الأكسشن: edsair.doi...........817e638360d3aa40d360c3b4b3cedf77
قاعدة البيانات: OpenAIRE