دورية أكاديمية

ANNETTE: Accurate Neural Network Execution Time Estimation With Stacked Models

التفاصيل البيبلوغرافية
العنوان: ANNETTE: Accurate Neural Network Execution Time Estimation With Stacked Models
المؤلفون: Matthias Wess, Matvey Ivanov, Christoph Unger, Anvesh Nookala, Alexander Wendt, Axel Jantsch
المصدر: IEEE Access, Vol 9, Pp 3545-3556 (2021)
بيانات النشر: IEEE, 2021.
سنة النشر: 2021
المجموعة: LCC:Electrical engineering. Electronics. Nuclear engineering
مصطلحات موضوعية: Analytical models, estimation, neural network hardware, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
الوصف: With new accelerator hardware for Deep Neural Networks (DNNs), the computing power for Artificial Intelligence (AI) applications has increased rapidly. However, as DNN algorithms become more complex and optimized for specific applications, latency requirements remain challenging, and it is critical to find the optimal points in the design space. To decouple the architectural search from the target hardware, we propose a time estimation framework that allows for modeling the inference latency of DNNs on hardware accelerators based on mapping and layer-wise estimation models. The proposed methodology extracts a set of models from micro-kernel and multi-layer benchmarks and generates a stacked model for mapping and network execution time estimation. We compare estimation accuracy and fidelity of the generated mixed models, statistical models with the roofline model, and a refined roofline model for evaluation. We test the mixed models on the ZCU102 SoC board with Xilinx Deep Neural Network Development Kit (DNNDK) and Intel Neural Compute Stick 2 (NCS2) on a set of 12 state-of-the-art neural networks. It shows an average estimation error of 3.47% for the DNNDK and 7.44% for the NCS2, outperforming the statistical and analytical layer models for almost all selected networks. For a randomly selected subset of 34 networks of the NASBench dataset, the mixed model reaches fidelity of 0.988 in Spearman’s $\rho $ rank correlation coefficient metric.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2169-3536
Relation: https://ieeexplore.ieee.org/document/9306831/; https://doaj.org/toc/2169-3536
DOI: 10.1109/ACCESS.2020.3047259
URL الوصول: https://doaj.org/article/1546e71ebabf4c1381e2e6170570a848
رقم الأكسشن: edsdoj.1546e71ebabf4c1381e2e6170570a848
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21693536
DOI:10.1109/ACCESS.2020.3047259