Learning to Steer Markovian Agents under Model Uncertainty

التفاصيل البيبلوغرافية
العنوان: Learning to Steer Markovian Agents under Model Uncertainty
المؤلفون: Huang, Jiawei, Thoma, Vinzenz, Shen, Zebang, Nax, Heinrich H., He, Niao
سنة النشر: 2024
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems, Statistics - Machine Learning
الوصف: Designing incentives for an adapting population is a ubiquitous problem in a wide array of economic applications and beyond. In this work, we study how to design additional rewards to steer multi-agent systems towards desired policies \emph{without} prior knowledge of the agents' underlying learning dynamics. We introduce a model-based non-episodic Reinforcement Learning (RL) formulation for our steering problem. Importantly, we focus on learning a \emph{history-dependent} steering strategy to handle the inherent model uncertainty about the agents' learning dynamics. We introduce a novel objective function to encode the desiderata of achieving a good steering outcome with reasonable cost. Theoretically, we identify conditions for the existence of steering strategies to guide agents to the desired policies. Complementing our theoretical contributions, we provide empirical algorithms to approximately solve our objective, which effectively tackles the challenge in learning history-dependent strategies. We demonstrate the efficacy of our algorithms through empirical evaluations.
Comment: 33 Pages
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.10207
رقم الأكسشن: edsarx.2407.10207
قاعدة البيانات: arXiv