دورية أكاديمية

Model-Free Optimal Consensus Control for Multi-agent Systems Based on DHP Algorithm.

التفاصيل البيبلوغرافية
العنوان: Model-Free Optimal Consensus Control for Multi-agent Systems Based on DHP Algorithm.
المؤلفون: Shi, Haoen, Feng, Yanghe, Mu, Chaoxu, Wu, Yunkai
المصدر: Neural Processing Letters; Feb2022, Vol. 54 Issue 1, p501-521, 21p
مصطلحات موضوعية: MULTIAGENT systems, DISTRIBUTED algorithms, HEURISTIC programming, ITERATIVE learning control, ALGORITHMS, REINFORCEMENT learning, DYNAMIC programming, HAMILTON-Jacobi-Bellman equation
مستخلص: This paper developes a novel model-free dual heuristic dynamic programming (DHP) algorithm combined with policy iteration and least square techniques to implement optimal consensus control of discrete-time multi-agent systems. The coupled Hamilton-Jacobi-Bellman (HJB) equations are required to be solved to achieve optimal consensus control, which is generally difficult especially under the case of unknown mathematical models. To overcome above difficulties, the DHP method is carried out by reinforcement learning utilizing online collected data rather than the accurate system dynamics. First, the performance index and corresponding Bellman equation are acquired. Each agent's value function has quadratic form. Then, a model network is employed to approximate the accurate system dynamics. The Q-function Bellman equation is obtained next. By taking the derivative of Q-function, the DHP method is applied to construct the update formula. Convergence and stability analysis of proposed algorithm are presented. Two simulation examples are provided to illustrate the validity of the proposed algorithm. [ABSTRACT FROM AUTHOR]
Copyright of Neural Processing Letters is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:13704621
DOI:10.1007/s11063-021-10641-4