| 引用本文: | 吕梦欣,石宗英,钟宜生.多智能体强化学习在微分博弈中的应用[J].控制理论与应用,2025,42(11):2165~2178.[点击复制] |
| LV Meng-xin,SHI Zong-ying,ZHONG Yi-sheng.Applications of multi-agent reinforcement learning in differential games[J].Control Theory & Applications,2025,42(11):2165~2178.[点击复制] |
|
| 多智能体强化学习在微分博弈中的应用 |
| Applications of multi-agent reinforcement learning in differential games |
| 摘要点击 2296 全文点击 198 投稿时间:2025-03-29 修订日期:2025-10-16 |
| 查看全文 查看/发表评论 下载PDF阅读器 HTML |
| DOI编号 10.7641/CTA.2025.50128 |
| 2025,42(11):2165-2178 |
| 中文关键词 强化学习 微分博弈 多智能体系统 最优控制 |
| 英文关键词 reinforcement learning differential games mult-agent systems optimal control systems |
| 基金项目 国家自然科学基金项目(U24B20173,62573016),国家重点研发计划项目(2020YFF0400100)资助. |
|
| 中文摘要 |
| 微分博弈是一种研究对抗环境下多智能体系统决策问题的方法,在经济、工程等领域都有着广泛的应用前
景. 然而当问题建模接近真实场景时,微分博弈的理论求解面临诸多困难.近年来,随着人工智能技术的快速发展,
多智能体深度强化学习方法的研究取得了瞩目突破,为解决微分博弈所面临的挑战提供了新的思路.本文首先对
这二者各自的原理、发展现状进行了梳理;之后详细介绍应用于微分博弈的强化学习算法,并根据微分博弈问题类
别和网络模型进行细分,总结目前研究对传统方法所面临困难的解决情况;最后,对微分博弈与多智能体强化学习
的发展现状进行分析,给出未来可能的研究方向. |
| 英文摘要 |
| Differential games offer a powerful framework for modeling and analyzing the decision-making problems
in multi-agent systems under competitive environments, with extensive application prospects in fields like economics and
industry. However, as the problem modeling approaches real-world scenarios, obtaining theoretical solutions becomes
increasingly challenging. In recent years, with the rapid advancement of artificial intelligence, multi-agent deep rein
forcement learning has achieved significant breakthroughs, providing promising alternatives for addressing the challenges
encountered in differential games. This paper firstly provides a comprehensive review of the fundamental principles and lat
est development of both fields. The paper further details the reinforcement learning methods applied to differential games,
categorizing them based on the problem formulations and deep-learning network architectures, and illustrates how recent
researches overcome the challenges faced by traditional methods. Finally, the paper summarizes the current progress in this
interdisciplinary research area and suggests potential future directions. |
|
|
|
|
|