ENMC: Extreme Near-Memory Classification via Approximate Screening

التفاصيل البيبلوغرافية
العنوان: ENMC: Extreme Near-Memory Classification via Approximate Screening
المؤلفون: Zheng Qu, Liu Liu, Yuan Xie, Jilan Lin, Yufei Ding
المصدر: MICRO
بيانات النشر: ACM, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Speedup, Computational complexity theory, Computer engineering, business.industry, Computer science, Deep learning, Component (UML), Classifier (linguistics), Artificial intelligence, Language model, Central processing unit, business, Power budget
الوصف: Extreme classification (XC) is the essential component of large-scale Deep Learning Systems for a wide range of application domains, including image recognition, language modeling, and recommendation. As classification categories keep scaling in real-world applications, the classifier’s parameters could reach several thousands of Gigabytes, way exceed the on-chip memory capacity. With the advent of near-memory processing (NMP) architectures, offloading the XC component onto NMP units could alleviate the memory-intensive problem. However, naive NMP design with limited area and power budget cannot afford the computational complexity of full classification. To tackle the problem, we first propose a novel screening method to reduce the computation and memory consumption by efficiently approximating the classification output and identifying a small portion of key candidates that require accurate results. Then, we design a new extreme-classification-tailored NMP architecture, namely ENMC, to support both screening and candidates-only classification. Overall, our approximate screening method achieves 7.3 × speedup over the CPU baseline, and ENMC further improves the performance by 7.4 × and demonstrates 2.7 × speedup compared with the state-of-the-art NMP baseline.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::ce36f66ff09ccc48ca37144d4455ba15
https://doi.org/10.1145/3466752.3480090
حقوق: OPEN
رقم الأكسشن: edsair.doi...........ce36f66ff09ccc48ca37144d4455ba15
قاعدة البيانات: OpenAIRE