Accelerating Machine Learning Inference with GPUs in ProtoDUNE Data Processing

التفاصيل البيبلوغرافية
العنوان: Accelerating Machine Learning Inference with GPUs in ProtoDUNE Data Processing
المؤلفون: Cai, Tejin, Herner, Kenneth, Yang, Tingjun, Wang, Michael, Flechas, Maria Acosta, Harris, Philip, Holzman, Burt, Pedro, Kevin, Tran, Nhan
المصدر: Comput Softw Big Sci 7, 11 (2023)
سنة النشر: 2023
المجموعة: Computer Science
High Energy Physics - Experiment
Physics (Other)
مصطلحات موضوعية: High Energy Physics - Experiment, Computer Science - Distributed, Parallel, and Cluster Computing, Physics - Data Analysis, Statistics and Probability
الوصف: We study the performance of a cloud-based GPU-accelerated inference server to speed up event reconstruction in neutrino data batch jobs. Using detector data from the ProtoDUNE experiment and employing the standard DUNE grid job submission tools, we attempt to reprocess the data by running several thousand concurrent grid jobs, a rate we expect to be typical of current and future neutrino physics experiments. We process most of the dataset with the GPU version of our processing algorithm and the remainder with the CPU version for timing comparisons. We find that a 100-GPU cloud-based server is able to easily meet the processing demand, and that using the GPU version of the event processing algorithm is two times faster than processing these data with the CPU version when comparing to the newest CPUs in our sample. The amount of data transferred to the inference server during the GPU runs can overwhelm even the highest-bandwidth network switches, however, unless care is taken to observe network facility limits or otherwise distribute the jobs to multiple sites. We discuss the lessons learned from this processing campaign and several avenues for future improvements.
Comment: 13 pages, 9 figures, matches accepted version
نوع الوثيقة: Working Paper
DOI: 10.1007/s41781-023-00101-0
URL الوصول: http://arxiv.org/abs/2301.04633
رقم الأكسشن: edsarx.2301.04633
قاعدة البيانات: arXiv
الوصف
DOI:10.1007/s41781-023-00101-0