Explorative Imitation Learning: A Path Signature Approach for Continuous Environments

التفاصيل البيبلوغرافية
العنوان: Explorative Imitation Learning: A Path Signature Approach for Continuous Environments
المؤلفون: Gavenski, Nathan, Monteiro, Juarez, Meneguzzi, Felipe, Luck, Michael, Rodrigues, Odinaldo
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
الوصف: Some imitation learning methods combine behavioural cloning with self-supervision to infer actions from state pairs. However, most rely on a large number of expert trajectories to increase generalisation and human intervention to capture key aspects of the problem, such as domain constraints. In this paper, we propose Continuous Imitation Learning from Observation (CILO), a new method augmenting imitation learning with two important features: (i) exploration, allowing for more diverse state transitions, requiring less expert trajectories and resulting in fewer training iterations; and (ii) path signatures, allowing for automatic encoding of constraints, through the creation of non-parametric representations of agents and expert trajectories. We compared CILO with a baseline and two leading imitation learning methods in five environments. It had the best overall performance of all methods in all environments, outperforming the expert in two of them.
Comment: This paper has been accepted in the 27th European Conference on Artificial Intelligence (ECAI) 2024
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.04856
رقم الأكسشن: edsarx.2407.04856
قاعدة البيانات: arXiv