Function-space Parameterization of Neural Networks for Sequential Learning

التفاصيل البيبلوغرافية
العنوان: Function-space Parameterization of Neural Networks for Sequential Learning
المؤلفون: Scannell, Aidan, Mereu, Riccardo, Chang, Paul, Tamir, Ella, Pajarinen, Joni, Solin, Arno
سنة النشر: 2024
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Statistics - Machine Learning, Computer Science - Machine Learning
الوصف: Sequential learning paradigms pose challenges for gradient-based deep learning due to difficulties incorporating new data and retaining prior knowledge. While Gaussian processes elegantly tackle these problems, they struggle with scalability and handling rich inputs, such as images. To address these issues, we introduce a technique that converts neural networks from weight space to function space, through a dual parameterization. Our parameterization offers: (i) a way to scale function-space methods to large data sets via sparsification, (ii) retention of prior knowledge when access to past data is limited, and (iii) a mechanism to incorporate new data without retraining. Our experiments demonstrate that we can retain knowledge in continual learning and incorporate new data efficiently. We further show its strengths in uncertainty quantification and guiding exploration in model-based RL. Further information and code is available on the project website.
Comment: 29 pages, 8 figures, Published in The Twelfth International Conference on Learning Representations
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2403.10929
رقم الأكسشن: edsarx.2403.10929
قاعدة البيانات: arXiv