Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions

التفاصيل البيبلوغرافية
العنوان: Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions
المؤلفون: Christodoulou, Petros, Lange, Robert Tjarko, Shafti, Ali, Faisal, A. Aldo
سنة النشر: 2019
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
الوصف: From a young age humans learn to use grammatical principles to hierarchically combine words into sentences. Action grammars is the parallel idea, that there is an underlying set of rules (a "grammar") that govern how we hierarchically combine actions to form new, more complex actions. We introduce the Action Grammar Reinforcement Learning (AG-RL) framework which leverages the concept of action grammars to consistently improve the sample efficiency of Reinforcement Learning agents. AG-RL works by using a grammar inference algorithm to infer the "action grammar" of an agent midway through training. The agent's action space is then augmented with macro-actions identified by the grammar. We apply this framework to Double Deep Q-Learning (AG-DDQN) and a discrete action version of Soft Actor-Critic (AG-SAC) and find that it improves performance in 8 out of 8 tested Atari games (median +31%, max +668%) and 19 out of 20 tested Atari games (median +96%, maximum +3,756%) respectively without substantive hyperparameter tuning. We also show that AG-SAC beats the model-free state-of-the-art for sample efficiency in 17 out of the 20 tested Atari games (median +62%, maximum +13,140%), again without substantive hyperparameter tuning.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/1910.02876
رقم الأكسشن: edsarx.1910.02876
قاعدة البيانات: arXiv