دورية أكاديمية

DM-DQN: Dueling Munchausen deep Q network for robot path planning

التفاصيل البيبلوغرافية
العنوان: DM-DQN: Dueling Munchausen deep Q network for robot path planning
المؤلفون: Yuwan Gu, Zhitao Zhu, Jidong Lv, Lin Shi, Zhenjie Hou, Shoukun Xu
المصدر: Complex & Intelligent Systems, Vol 9, Iss 4, Pp 4287-4300 (2022)
بيانات النشر: Springer, 2022.
سنة النشر: 2022
المجموعة: LCC:Electronic computers. Computer science
LCC:Information technology
مصطلحات موضوعية: Deep reinforcement learning, DM-DQN, Path planning, Dueling network, Electronic computers. Computer science, QA75.5-76.95, Information technology, T58.5-58.64
الوصف: Abstract In order to achieve collision-free path planning in complex environment, Munchausen deep Q-learning network (M-DQN) is applied to mobile robot to learn the best decision. On the basis of Soft-DQN, M-DQN adds the scaled log-policy to the immediate reward. The method allows agent to do more exploration. However, the M-DQN algorithm has the problem of slow convergence. A new and improved M-DQN algorithm (DM-DQN) is proposed in the paper to address the problem. First, its network structure was improved on the basis of M-DQN by decomposing the network structure into a value function and an advantage function, thus decoupling action selection and action evaluation and speeding up its convergence, giving it better generalization performance and enabling it to learn the best decision faster. Second, to address the problem of the robot’s trajectory being too close to the edge of the obstacle, a method of using an artificial potential field to set a reward function is proposed to drive the robot’s trajectory away from the vicinity of the obstacle. The result of simulation experiment shows that the method learns more efficiently and converges faster than DQN, Dueling DQN and M-DQN in both static and dynamic environments, and is able to plan collision-free paths away from obstacles.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2199-4536
2198-6053
Relation: https://doaj.org/toc/2199-4536; https://doaj.org/toc/2198-6053
DOI: 10.1007/s40747-022-00948-7
URL الوصول: https://doaj.org/article/32168163282b494a9a606fb8cb9fd303
رقم الأكسشن: edsdoj.32168163282b494a9a606fb8cb9fd303
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21994536
21986053
DOI:10.1007/s40747-022-00948-7