Maximizing system performance by balancing computation loads in LSTM accelerators

التفاصيل البيبلوغرافية
العنوان: Maximizing system performance by balancing computation loads in LSTM accelerators
المؤلفون: Jaeha Kung, Junki Park, Wooseok Yi, Jae-Joon Kim
المصدر: DATE
بيانات النشر: IEEE, 2018.
سنة النشر: 2018
مصطلحات موضوعية: Artificial neural network, Computer science, business.industry, Deep learning, Computation, 020208 electrical & electronic engineering, 02 engineering and technology, 020202 computer hardware & architecture, Computer engineering, 0202 electrical engineering, electronic engineering, information engineering, Overhead (computing), Multiplication, Artificial intelligence, Pruning (decision trees), business, Sparse matrix
الوصف: The LSTM is a popular neural network model for modeling or analyzing the time-varying data. The main operation of LSTM is a matrix-vector multiplication and it becomes sparse (spMxV) due to the widely-accepted weight pruning in deep learning. This paper presents a new sparse matrix format, named CBSR, to maximize the inference speed of the LSTM accelerator. In the CBSR format, speed-up is achieved by balancing out the computation loads over PEs. Along with the new format, we present a simple network transformation to completely remove the hardware overhead incurred when using the CBSR format. Also, the detailed analysis on the impact of network size or the number of PEs is performed, which lacks in the prior work. The simulation results show 16∼38% improvement in the system performance compared to the well-known CSC/CSR format. The power analysis is also performed in 65nm CMOS technology to show 9∼22% energy savings.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::6db0bac58a9335cbad406467284b081b
https://doi.org/10.23919/date.2018.8341971
رقم الأكسشن: edsair.doi...........6db0bac58a9335cbad406467284b081b
قاعدة البيانات: OpenAIRE