Data-Efficient Policy Selection for Navigation in Partial Maps via Subgoal-Based Abstraction

التفاصيل البيبلوغرافية
العنوان: Data-Efficient Policy Selection for Navigation in Partial Maps via Subgoal-Based Abstraction
المؤلفون: Paudel, Abhishek, Stein, Gregory J.
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Robotics
الوصف: We present a novel approach for fast and reliable policy selection for navigation in partial maps. Leveraging the recent learning-augmented model-based Learning over Subgoals Planning (LSP) abstraction to plan, our robot reuses data collected during navigation to evaluate how well other alternative policies could have performed via a procedure we call offline alt-policy replay. Costs from offline alt-policy replay constrain policy selection among the LSP-based policies during deployment, allowing for improvements in convergence speed, cumulative regret and average navigation cost. With only limited prior knowledge about the nature of unseen environments, we achieve at least 67% and as much as 96% improvements on cumulative regret over the baseline bandit approach in our experiments in simulated maze and office-like environments.
Comment: 8 pages, 5 figures. Accepted at IROS 2023
نوع الوثيقة: Working Paper
DOI: 10.1109/IROS55552.2023.10342047
URL الوصول: http://arxiv.org/abs/2304.01094
رقم الأكسشن: edsarx.2304.01094
قاعدة البيانات: arXiv
الوصف
DOI:10.1109/IROS55552.2023.10342047