Accelerating Low-Rank Factorization-Based Semidefinite Programming Algorithms on GPU

التفاصيل البيبلوغرافية
العنوان: Accelerating Low-Rank Factorization-Based Semidefinite Programming Algorithms on GPU
المؤلفون: Han, Qiushi, Lin, Zhenwei, Liu, Hanwen, Chen, Caihua, Deng, Qi, Ge, Dongdong, Ye, Yinyu
سنة النشر: 2024
المجموعة: Mathematics
مصطلحات موضوعية: Mathematics - Optimization and Control
الوصف: In this paper, we address a long-standing challenge: how to achieve both efficiency and scalability in solving semidefinite programming problems. We propose breakthrough acceleration techniques for a wide range of low-rank factorization-based first-order methods using GPUs, making the computation much more efficient and scalable. To illustrate the idea and effectiveness of our approach, we use the low-rank factorization-based SDP solver, LoRADS, as an example, which involves both the classic Burer-Monterio method and a novel splitting scheme with a starting logarithmic rank. Our numerical results demonstrate that the accelerated GPU version of LoRADS, cuLoRADS, can solve huge-scale semidefinite programming problems with remarkable efficiency. By effectively leveraging GPU computational power, cuLoRADS exhibits outstanding performance. Specifically, it can solve a set of MaxCut problems with $10^7 \times 10^7$ matrix variables in 10 seconds to 1 minute each on an NVIDIA H100 GPU with 80GB memory, whereas previous solvers demonstrated the capability of handling problems of this scale, required at least dozens of hours per problem on CPUs. Additionally, cuLoRADS shows exceptional scalability by solving 1) a MaxCut problem with a $170 \text{ million} \times 170 \text{ million}$ matrix variable and 2) a Matrix Completion problem with a 20 million $\times$ 20 million matrix variable and approximately 200 million constraints, both in a matter of minutes.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.15049
رقم الأكسشن: edsarx.2407.15049
قاعدة البيانات: arXiv