多智能体强化学习在微分博弈中的应用

吕梦欣; 石宗英; 钟宜生

引用本文:	吕梦欣,石宗英,钟宜生.多智能体强化学习在微分博弈中的应用[J].控制理论与应用,2025,42(11):2165~2178.[点击复制]
	LV Meng-xin,SHI Zong-ying,ZHONG Yi-sheng.Applications of multi-agent reinforcement learning in differential games[J].Control Theory & Applications,2025,42(11):2165~2178.[点击复制]

多智能体强化学习在微分博弈中的应用

Applications of multi-agent reinforcement learning in differential games

摘要点击 2296 全文点击 198 投稿时间：2025-03-29 修订日期：2025-10-16

查看全文查看/发表评论下载PDF阅读器 HTML

DOI编号 10.7641/CTA.2025.50128

2025,42(11):2165-2178

中文关键词强化学习微分博弈多智能体系统最优控制

英文关键词 reinforcement learning differential games mult-agent systems optimal control systems

基金项目国家自然科学基金项目(U24B20173,62573016),国家重点研发计划项目(2020YFF0400100)资助.

作者	单位	E-mail
吕梦欣	清华大学自动化系	lvmx23@mails.tsinghua.edu.cn
石宗英	清华大学自动化系	szy@mail.tsinghua.edu.cn
钟宜生^*	清华大学自动化系

中文摘要

微分博弈是一种研究对抗环境下多智能体系统决策问题的方法,在经济、工程等领域都有着广泛的应用前景. 然而当问题建模接近真实场景时,微分博弈的理论求解面临诸多困难.近年来,随着人工智能技术的快速发展, 多智能体深度强化学习方法的研究取得了瞩目突破,为解决微分博弈所面临的挑战提供了新的思路.本文首先对这二者各自的原理、发展现状进行了梳理;之后详细介绍应用于微分博弈的强化学习算法,并根据微分博弈问题类别和网络模型进行细分,总结目前研究对传统方法所面临困难的解决情况;最后,对微分博弈与多智能体强化学习的发展现状进行分析,给出未来可能的研究方向.

英文摘要

Differential games offer a powerful framework for modeling and analyzing the decision-making problems in multi-agent systems under competitive environments, with extensive application prospects in fields like economics and industry. However, as the problem modeling approaches real-world scenarios, obtaining theoretical solutions becomes increasingly challenging. In recent years, with the rapid advancement of artificial intelligence, multi-agent deep rein forcement learning has achieved significant breakthroughs, providing promising alternatives for addressing the challenges encountered in differential games. This paper firstly provides a comprehensive review of the fundamental principles and lat est development of both fields. The paper further details the reinforcement learning methods applied to differential games, categorizing them based on the problem formulations and deep-learning network architectures, and illustrates how recent researches overcome the challenges faced by traditional methods. Finally, the paper summarizes the current progress in this interdisciplinary research area and suggests potential future directions.