Skip to main content
Log in

Multi-agent differential game based cooperative synchronization control using a data-driven method

基于多智能体微分博弈的数据驱动协同一致控制

  • Published:
Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Abstract

This paper studies the multi-agent differential game based problem and its application to cooperative synchronization control. A systematized formulation and analysis method for the multi-agent differential game is proposed and a data-driven methodology based on the reinforcement learning (RL) technique is given. First, it is pointed out that typical distributed controllers may not necessarily lead to global Nash equilibrium of the differential game in general cases because of the coupling of networked interactions. Second, to this end, an alternative local Nash solution is derived by defining the best response concept, while the problem is decomposed into local differential games. An off-policy RL algorithm using neighboring interactive data is constructed to update the controller without requiring a system model, while the stability and robustness properties are proved. Third, to further tackle the dilemma, another differential game configuration is investigated based on modified coupling index functions. The distributed solution can achieve global Nash equilibrium in contrast to the previous case while guaranteeing the stability. An equivalent parallel RL method is constructed corresponding to this Nash solution. Finally, the effectiveness of the learning process and the stability of synchronization control are illustrated in simulation results.

摘要

本文研究了多智能体微分博弈问题及其在协同一致控制中的应用。提出系统化的多智能体微分博弈构建和分析方法, 同时给出一种基于强化学习技术的数据驱动方法。首先论证了由于网络交互的耦合特性, 典型的分布式控制器无法充分保证微分博弈的全局纳什均衡。其次通过定义最优对策的概念, 将问题分解为局部微分博弈问题, 并给出局部纳什均衡解。构造了一种无需系统模型信息的离轨策略强化学习算法, 利用在线邻居交互数据对控制器进行优化更新, 并证明控制器的稳定性和鲁棒性。进一步提出一种基于改进耦合指标函数的微分博弈模型及其等效的强化学习求解方法。与现有研究相比, 该模型解决了多智能体所需信息的耦合问题, 并实现分布式框架下全局纳什均衡和稳定控制。构造了与此纳什解对应的等价并行强化学习方法。最后, 仿真结果验证了学习过程的有效性和一致控制的稳定性。

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiwang Dong  (董希旺).

Additional information

Project supported by the Science and Technology Innovation 2030, China (No. 2020AAA0108200), the National Natural Science Foundation of China (Nos. 61873011, 61973013, 61922008, and 61803014), the Defense Industrial Technology Development Program, China (No. JCKY2019601C106), the Innovation Zone Project, China (No. 18-163-00-TS-001-001-34), the Foundation Strengthening Program Technology Field Fund, China (No. 2019-JCJQ-JJ-243), and the Fund from the Key Laboratory of Dependable Service Computing in Cyber Physical Society, China (No. CPSDSC202001)

Contributors

Yu SHI designed the research, conducted the simulations, and drafted the paper. Yongzhao HUA and Jianglong YU helped organize the paper. Xiwang DONG and Zhang REN revised and finalized the paper.

Compliance with ethics guidelines

Yu SHI, Yongzhao HUA, Jianglong YU, Xiwang DONG, and Zhang REN declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shi, Y., Hua, Y., Yu, J. et al. Multi-agent differential game based cooperative synchronization control using a data-driven method. Front Inform Technol Electron Eng 23, 1043–1056 (2022). https://doi.org/10.1631/FITEE.2200001

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/FITEE.2200001

Key words

CLC number

关键词

Navigation