Layer-Wise Partitioning and Merging for Efficient and Scalable Deep Learning

التفاصيل البيبلوغرافية
العنوان: Layer-Wise Partitioning and Merging for Efficient and Scalable Deep Learning
المؤلفون: Akintoye, Samson B., Han, Liangxiu, Lloyd, Huw, Zhang, Xin, Dancey, Darren, Chen, Haoming, Zhang, Daoqiang
سنة النشر: 2022
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Machine Learning
الوصف: Deep Neural Network (DNN) models are usually trained sequentially from one layer to another, which causes forward, backward and update locking's problems, leading to poor performance in terms of training time. The existing parallel strategies to mitigate these problems provide suboptimal runtime performance. In this work, we have proposed a novel layer-wise partitioning and merging, forward and backward pass parallel framework to provide better training performance. The novelty of the proposed work consists of 1) a layer-wise partition and merging model which can minimise communication overhead between devices without the memory cost of existing strategies during the training process; 2) a forward pass and backward pass parallelisation and optimisation to address the update locking problem and minimise the total training cost. The experimental evaluation on real use cases shows that the proposed method outperforms the state-of-the-art approaches in terms of training speed; and achieves almost linear speedup without compromising the accuracy performance of the non-parallel approach.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2207.11019
رقم الأكسشن: edsarx.2207.11019
قاعدة البيانات: arXiv