Mean Nystr\'om Embeddings for Adaptive Compressive Learning

التفاصيل البيبلوغرافية
العنوان: Mean Nystr\'om Embeddings for Adaptive Compressive Learning
المؤلفون: Chatalic, Antoine, Carratino, Luigi, De Vito, Ernesto, Rosasco, Lorenzo
سنة النشر: 2021
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Statistics - Machine Learning, Computer Science - Machine Learning
الوصف: Compressive learning is an approach to efficient large scale learning based on sketching an entire dataset to a single mean embedding (the sketch), i.e. a vector of generalized moments. The learning task is then approximately solved as an inverse problem using an adapted parametric model. Previous works in this context have focused on sketches obtained by averaging random features, that while universal can be poorly adapted to the problem at hand. In this paper, we propose and study the idea of performing sketching based on data-dependent Nystr\"om approximation. From a theoretical perspective we prove that the excess risk can be controlled under a geometric assumption relating the parametric model used to learn from the sketch and the covariance operator associated to the task at hand. Empirically, we show for k-means clustering and Gaussian modeling that for a fixed sketch size, Nystr\"om sketches indeed outperform those built with random features.
Comment: Accepted to AISTATS 2022. 21 pages, 4 figures
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2110.10996
رقم الأكسشن: edsarx.2110.10996
قاعدة البيانات: arXiv