Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification

التفاصيل البيبلوغرافية
العنوان: Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification
المؤلفون: Lu, Fred, Curtin, Ryan R., Raff, Edward, Ferraro, Francis, Holt, James
سنة النشر: 2024
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing, Statistics - Machine Learning
الوصف: While distributed training is often viewed as a solution to optimizing linear models on increasingly large datasets, inter-machine communication costs of popular distributed approaches can dominate as data dimensionality increases. Recent work on non-interactive algorithms shows that approximate solutions for linear models can be obtained efficiently with only a single round of communication among machines. However, this approximation often degenerates as the number of machines increases. In this paper, building on the recent optimal weighted average method, we introduce a new technique, ACOWA, that allows an extra round of communication to achieve noticeably better approximation quality with minor runtime increases. Results show that for sparse distributed logistic regression, ACOWA obtains solutions that are more faithful to the empirical risk minimizer and attain substantially higher accuracy than other distributed algorithms.
Comment: Under review
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2406.01753
رقم الأكسشن: edsarx.2406.01753
قاعدة البيانات: arXiv