期刊文献+

强化学习在足球机器人基本动作学习中的应用 被引量:6

Application of Reinforcement Learning to Basic Action Learning of Soccer Robot
下载PDF
导出
摘要 主要研究了强化学习算法及其在机器人足球比赛技术动作学习问题中的应用.强化学习的状态空间和动作空间过大或变量连续,往往导致学习的速度过慢甚至难于收敛.针对这一问题,提出了基于T-S模型模糊神经网络的强化学习方法,能够有效地实现强化学习状态空间到动作空间的映射.此外,使用提出的强化学习方法设计了足球机器人的技术动作,研究了在不需要专家知识和环境模型情况下机器人的行为学习问题.最后,通过实验证明了所研究方法的有效性,其能够满足机器人足球比赛的需要. This paper discusses reinforcement learning (RL) algorithm and its application to technical action learning of soccer robot. In RL, since the state space and action space are too large or their variables are continuous, the learning speed are too slow and it is usually too hard for learning to converge. To solve this problem, an RL method based on T-S model fuzzy neural network is proposed, which can effectively perform the mapping from the state space to the action space of RL. Furthermore, the proposed method is used to design technical actions of soccer robot, and behavior learning of the robot without expert knowledge and environment model is discussed. Finally, experiments are made and the results show that the presented method is effective and it can meet the demands of robot soccer match.
出处 《机器人》 EI CSCD 北大核心 2008年第5期453-459,共7页 Robot
基金 国家自然科学基金(60475036)
关键词 强化学习 机器人足球比赛 行为学习 T-S模糊神经网络 reinforcement learning (RL) robot soccer match behavior learning T-S fuzzy neural network
  • 相关文献

参考文献11

  • 1Camacho D, Fernandez F, Rodelgo M A. Roboskeleton: An architecture for coordinating robot soccer agents[J]. Engineering Applications of Artificial Intelligence, 2006, 19(2): 179-188.
  • 2Sutton R S, Barto A G. Reinforcement Learning: An Introduction[M]. Cambridge, MA, USA: MIT Press, 1998.
  • 3Bartlett P L. An introduction to reinforcement learning theory: Value function methods[J]. Advanced Lectures on Machine Learning, 2003, 2600: 184-202.
  • 4Jouffe L. Fuzzy inference system learning by reinforcement methods[J]. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 1998, 28(3): 338-355.
  • 5Watkins C J C H, Dayan E Technical note: Q-learning[J]. Machine Learning, 1992, 8(3-4): 279-292.
  • 6赵顺珍.基于神经网络的永磁同步电动机模糊控制[J].沈阳工业大学学报,2006,28(1):62-64. 被引量:6
  • 7梁中华,林志明,刘鑫,许娟.基于模糊控制的PWM整流器的抗负载扰动性能[J].沈阳工业大学学报,2007,29(6):711-715. 被引量:2
  • 8Baird L C. Residual algorithms: Reinforcement learning with function approximation[A]. Proceedings of the 12th International Conference on Machine Learning[C]. San Francisco, CA, USA: Morgan Kaufmann Publishers, 1995.30-37.
  • 9Jung M J, Kim H S, Shim H S, et al. Fuzzy rule extraction for shooting action controller of soccer robot[A]. Proceedings of the IEEE International Fuzzy Systems Conference[C]. Piscataway, NJ, USA: IEEE, 1999. 556-561.
  • 10Stone P, Sutton R S, Kuhlmann. G. Reinforcement learning for robocup soccer keepaway[J]. Adaptive Behavior, 2005, 13(3): 165-188.

二级参考文献18

共引文献13

同被引文献91

引证文献6

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部