MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers

التفاصيل البيبلوغرافية
العنوان: MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers
المؤلفون: Schneider, Nadav, Kadosh, Tal, Hasabnis, Niranjan, Mattson, Timothy, Pinter, Yuval, Oren, Gal
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Computation and Language, Computer Science - Machine Learning
الوصف: Message Passing Interface (MPI) plays a crucial role in distributed memory parallelization across multiple nodes. However, parallelizing MPI code manually, and specifically, performing domain decomposition, is a challenging, error-prone task. In this paper, we address this problem by developing MPI-RICAL, a novel data-driven, programming-assistance tool that assists programmers in writing domain decomposition based distributed memory parallelization code. Specifically, we train a supervised language model to suggest MPI functions and their proper locations in the code on the fly. We also introduce MPICodeCorpus, the first publicly available corpus of MPI-based parallel programs that is created by mining more than 15,000 open-source repositories on GitHub. Experimental results have been done on MPICodeCorpus and more importantly, on a compiled benchmark of MPI-based parallel programs for numerical computations that represent real-world scientific applications. MPI-RICAL achieves F1 scores between 0.87-0.91 on these programs, demonstrating its accuracy in suggesting correct MPI functions at appropriate code locations.. The source code used in this work, as well as other relevant sources, are available at: https://github.com/Scientific-Computing-Lab-NRCN/MPI-rical
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2305.09438
رقم الأكسشن: edsarx.2305.09438
قاعدة البيانات: arXiv