期刊文献+

基于模糊聚类的分层强化学习算法

A Hierarchical Reinforcement Learning Algorithm Based on Fuzzy Clustering
下载PDF
导出
摘要 本文提出了一种新的分层强化学习Option自动生成算法,以Agent在学习初始阶段探测到的状态空间为输入,采用模糊逻辑神经元的网络进行聚类,在聚类后的各状态子集上通过经验回放学习产生内部策略集,生成Option,仿真实验结果表明了该算法的有效性。 A new algorithm for the automatic generation of the Option Hierarchical Reinforcement Learning is presented. The algorithm takes the state space detected by the agent as input in the initial learning phase, and clusters the states by employing fuzzy clustering. Based on the clustered state sets, the intra-strategies are learned by an experience replay procedure. As a result, the options are generated. The validity of the algorithm is demonstrated by simulation experiments.
作者 张欣 戴帅
出处 《计算机工程与科学》 CSCD 北大核心 2010年第1期55-56,91,共3页 Computer Engineering & Science
基金 湖南省教委资助项目(07C083)
关键词 强化学习 分层强化学习 模糊聚类 OPTION reinforcement learning hierarchical reinforcement learning fuzzy clustering Option
  • 相关文献

参考文献8

  • 1高阳,陈世福,陆鑫.强化学习研究综述[J].自动化学报,2004,30(1):86-100. 被引量:268
  • 2Parr R. Hierarchical Control and Learning for Markov Decision Processes: [Ph D Dissertation][D]. Berkeley: University of California, 1998.
  • 3Sutton R S, Precup D, Singh S P. Between MDPs and Semi- MDPs: A Framework for Temporal Abstraction in Reinforce ment Learning[J]. Artificial Intelligence, 1999, 112 ( 1 2) : 181-211.
  • 4Dietterich T G. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition[J]. Journal of Artificial Intelligence Research, 2000,13 : 227-303.
  • 5Hengst B. Discovering Hierarchy in Reinforcement Learning[D] . Sydney University of New South Wales, 2003.
  • 6Barto A G, Mahadevan S. Recent Advances in Hierarchical Reinforcement Learning [J]. Discrete Event Dynamic Systems: Theory and Applications, 2003,13(4) : 41-77.
  • 7Precup D. Temporal Abstraction in Reinforcement Learning: [Ph D Dissertation][D]. Massachusetts:University of Massachusetts, 2000.
  • 8Lin L G. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching [J]. Machine Learning, 1992,8(3-4) : 293-321.

二级参考文献4

共引文献267

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部