Survey on Large Scale Neural Network Training

التفاصيل البيبلوغرافية
العنوان: Survey on Large Scale Neural Network Training
المؤلفون: Gusak, Julia, Cherniuk, Daria, Shilova, Alena, Katrutsa, Alexander, Bershatsky, Daniel, Zhao, Xunyi, Eyraud-Dubois, Lionel, Shlyazhko, Oleg, Dimitrov, Denis, Oseledets, Ivan, Beaumont, Olivier
سنة النشر: 2022
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
الوصف: Modern Deep Neural Networks (DNNs) require significant memory to store weight, activations, and other intermediate tensors during training. Hence, many models do not fit one GPU device or can be trained using only a small per-GPU batch size. This survey provides a systematic overview of the approaches that enable more efficient DNNs training. We analyze techniques that save memory and make good use of computation and communication resources on architectures with a single or several GPUs. We summarize the main categories of strategies and compare strategies within and across categories. Along with approaches proposed in the literature, we discuss available implementations.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2202.10435
رقم الأكسشن: edsarx.2202.10435
قاعدة البيانات: arXiv