A 47.4µJ/epoch Trainable Deep Convolutional Neural Network Accelerator for In-Situ Personalization on Smart Devices

التفاصيل البيبلوغرافية
العنوان: A 47.4µJ/epoch Trainable Deep Convolutional Neural Network Accelerator for In-Situ Personalization on Smart Devices
المؤلفون: Seungkyu Choi, Jaehyeong Sim, Myeonggu Kang, Lee-Sup Kim, Yeongjae Choi, Hyeonuk Kim
المصدر: A-SSCC
بيانات النشر: IEEE, 2019.
سنة النشر: 2019
مصطلحات موضوعية: 010302 applied physics, Lossless compression, Multi-core processor, Dataflow, business.industry, Computer science, Deep learning, Computation, 02 engineering and technology, 01 natural sciences, Convolutional neural network, 020202 computer hardware & architecture, Computational science, 0103 physical sciences, Scalability, Datapath, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, business
الوصف: A scalable deep learning accelerator supporting both inference and training is implemented for device personalization of deep convolutional neural networks. It consists of three processor cores operating with distinct energy-efficient dataflow for different types of computation in CNN training. Two cores conduct forward and backward propagation in convolutional layers and utilize a masking scheme to reduce 88.3% of intermediate data to store for training. The third core executes weight update process in convolutional layers and inner product computation in fully connected layers with a novel large window dataflow. The system enables 8-bit fixed point datapath with lossless training and consumes $47.4\mu \mathrm{J}/\mathrm{epoch}$ for a customized deep CNN model.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::e170d8ea8d8abe65dee209edf8eb5923
https://doi.org/10.1109/a-sscc47793.2019.9056972
حقوق: CLOSED
رقم الأكسشن: edsair.doi...........e170d8ea8d8abe65dee209edf8eb5923
قاعدة البيانات: OpenAIRE