引用本文:周 锐,陈宗基.强化学习在导弹制导中的应用[J].控制理论与应用,2001,18(5):748~750.[点击复制]
ZHOU Rui,CHEN Zong-ji.Application of Reinforcement Learning in Missile Guidance[J].Control Theory and Technology,2001,18(5):748~750.[点击复制]
强化学习在导弹制导中的应用
Application of Reinforcement Learning in Missile Guidance
摘要点击 2072  全文点击 1546  投稿时间:2000-01-10  修订日期:2000-11-22
查看全文  查看/发表评论  下载PDF阅读器
DOI编号  10.7641/j.issn.1000-8152.2001.5.023
  2001,18(5):748-750
中文关键词  神经网络  强化学习  微分对策  导弹制导
英文关键词  neural networks  reinforcement learning  differential games  missile guidance
基金项目  国家自然科学基金(69904002); 国防预研基金; 航天科技创新基金资助项目.
作者单位
周 锐 北京航空航天大学 自动控制系, 北京 100083 
陈宗基 北京航空航天大学 自动控制系, 北京 100083 
中文摘要
      简述了强化学习的基本原理和特点, 讨论了强化学习中评价函数的神经网络近似问题, 重点分析了采用多神经网络近似评价函数的学习问题, 实现了状态空间或任务的自动分解, 提高了评价函数的推广能力. 网络的学习是离线进行, 并作为反馈控制器在线应用. 并以A 学习为例, 将强化学习应用于导弹的制导问题, 仿真结果表明了强化学习在导弹制导或控制问题中的应用前景和有效性.
英文摘要
      Principle and characteristic of reinforcement learning are outlined. The value function approximation of reinforcement learning with neural networks is studied, and the learning algorithm using modular neural networks to approximate the value function is emphatically analyzed, which decomposes the state space automatically and increases the generalizing ability of the neural networks. The neural networks are trained offline, and is used online as a feedback controller. The A learning algorithm is applied in the missile guidance problem, and the simulation results show the good performance and effectiveness of the application of reinforcement learning in those problems of missile guidance and control.