Convergence of Multi-Scale Reinforcement Q-Learning Algorithms for Mean Field Game and Control Problems

التفاصيل البيبلوغرافية
العنوان: Convergence of Multi-Scale Reinforcement Q-Learning Algorithms for Mean Field Game and Control Problems
المؤلفون: Angiuli, Andrea, Fouque, Jean-Pierre, Laurière, Mathieu, Zhang, Mengrui
سنة النشر: 2023
المجموعة: Mathematics
مصطلحات موضوعية: Mathematics - Optimization and Control
الوصف: We establish the convergence of the unified two-timescale Reinforcement Learning (RL) algorithm presented in a previous work by Angiuli et al. This algorithm provides solutions to Mean Field Game (MFG) or Mean Field Control (MFC) problems depending on the ratio of two learning rates, one for the value function and the other for the mean field term. Our proof of convergence highlights the fact that in the case of MFC several mean field distributions need to be updated and for this reason we present two separate algorithms, one for MFG and one for MFC. We focus on a setting with finite state and action spaces, discrete time and infinite horizon. The proofs of convergence rely on a generalization of the two-timescale approach of Borkar. The accuracy of approximation to the true solutions depends on the smoothing of the policies. We provide a numerical example illustrating the convergence.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2312.06659
رقم الأكسشن: edsarx.2312.06659
قاعدة البيانات: arXiv