期刊文献+

Collaborative multi-agent reinforcement learning based on experience propagation 被引量:5

Collaborative multi-agent reinforcement learning based on experience propagation
下载PDF
导出
摘要 For multi-agent reinforcement learning in Markov games, knowledge extraction and sharing are key research problems. State list extracting means to calculate the optimal shared state path from state trajectories with cycles. A state list extracting algorithm checks cyclic state lists of a current state in the state trajectory, condensing the optimal action set of the current state. By reinforcing the optimal action selected, the action policy of cyclic states is optimized gradually. The state list extracting is repeatedly learned and used as the experience knowledge which is shared by teams. Agents speed up the rate of convergence by experience sharing. Competition games of preys and predators are used for the experiments. The results of experiments prove that the proposed algorithms overcome the lack of experience in the initial stage, speed up learning and improve the performance. For multi-agent reinforcement learning in Markov games, knowledge extraction and sharing are key research problems. State list extracting means to calculate the optimal shared state path from state trajectories with cycles. A state list extracting algorithm checks cyclic state lists of a current state in the state trajectory, condensing the optimal action set of the current state. By reinforcing the optimal action selected, the action policy of cyclic states is optimized gradually. The state list extracting is repeatedly learned and used as the experience knowledge which is shared by teams. Agents speed up the rate of convergence by experience sharing. Competition games of preys and predators are used for the experiments. The results of experiments prove that the proposed algorithms overcome the lack of experience in the initial stage, speed up learning and improve the performance.
出处 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2013年第4期683-689,共7页 系统工程与电子技术(英文版)
基金 supported by the National Natural Science Foundation of China (61070143 61173088)
关键词 MULTI-AGENT Q learning state list extracting experience sharing. multi-agent Q learning state list extracting experience sharing.
  • 相关文献

参考文献2

二级参考文献17

  • 1蒋建国,夏娜,于春华.基于能力向量发挥率和拍卖的联盟形成策略[J].电子学报,2004,32(F12):215-217. 被引量:20
  • 2宋梅萍,顾国昌,张国印.随机博弈框架下的多agent强化学习方法综述[J].控制与决策,2005,20(10):1081-1090. 被引量:13
  • 3张国富,蒋建国,夏娜,苏兆品.基于离散粒子群算法求解复杂联盟生成问题[J].电子学报,2007,35(2):323-327. 被引量:33
  • 4Ho F,Kamel M.Learning coordinating strategies for cooperative multiagent systems[J].Machine Learning,1998,33(2-3):155 -177.
  • 5Garland A,Alterman R.Autonomous agents that learn to better coordinate[J].Autonomous Agents and Multi-Agent Systems,2004,8 (3):267-301.
  • 6Kaelbing L P,Littman M L,Moore A W.Reinforcement learning:A survey[J].Journal of Artificial Intelligence Research,1996,4:237-285.
  • 7Brafman R I,Tennenholtz M.Learning to coordinate efficiently:A model-based approach[J].Journal of Artificial Intelligence Research,2003,18:517-529.
  • 8Chen G,Yang Zh.Coordinating multiple agents via rein -forcement learning[J].Autonomous Agents and Multi-Agent Systems,2005,10 (3):273-328.
  • 9Watkins C J C H,Dayan P.Technical note:Q-learning[J].Machine learning,1992,8(3-4):279 -292.
  • 10Grefenstette J J.Credit assignment in rule discovery systems based on genetic algorithms[J].Machine Learning,1988,3(2-3):225 -245.

共引文献12

同被引文献26

引证文献5

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部