期刊文献+

基于SARSA算法的水库长期随机优化调度研究 被引量:12

Research on Long-term Stochastic Optimal Operation of Reservoir Based on SARSA Algorithm
原文传递
导出
摘要 针对水库长期随机调度的维数灾问题,在描述来水随机过程的基础上,提出基于强化学习理论的水库长期随机优化调度模型。采用机器学习中有模型的SARSA算法,且考虑入库随机变量的马尔可夫特性,通过贪婪决策与近似值迭代,调整学习参数,求解出近似最优决策序列。实例分析表明,对比随机动态规划(SDP)方法,SARSA算法在获得高质量解的同时,计算时间约减少41%,该算法高效求解能力与较少计算时长为水库长期随机调度问题提供了一种新的求解思路。 Aiming at the problem of the curse of dimensionality in long-term random scheduling of reservoir, based on describing the random process of inflow, a reinforcement learning method based SARSA algorithm was applied. The model considered the uncertainty of the runoff which was taken as simple Markov Decision Process (MDP). By greedy decision-making and approximate value iteration, the learning parameters were adjusted to determine the near-optimal decision-making sequence. Compared with stochastic dynamic programming (SDP) method, the example shows that the model based SARSA algorithm achieves a high quality solutions and the computation time is reduced by approximately 41 %. Its efficient solution and short calculation time provide a new solution idea for long-term stochastic operation of reservoir.
作者 李文武 张雪映 Daniel Eliote Mbanze 吴巍 LI Wen-wu 1,2, ZHANG Xue-ying 1,2,DANIEL Eliote Mbanze 1,2,WU Wei 1,2(1. Hubei Key Laboratory of Cascaded Hydropower Stations Operation & Control;2. College of Electrical Engineering & New Energy,China Three Gorges University, Yichang 443002, Chin)
出处 《水电能源科学》 北大核心 2018年第9期72-75,共4页 Water Resources and Power
基金 湖北省技术创新专项(重点项目)(2017AAA132)
关键词 水库调度 随机动态规划 强化学习 值迭代 SARSA reservoir operation SDP reinforcement learning value iteration SARSA
  • 相关文献

参考文献2

二级参考文献31

  • 1徐鼎甲,戴国瑞.梯级水电站的长期优化调度[J].水利学报,1989,21(5):43-48. 被引量:4
  • 2陈雪青,陈刚,张炜,王浩宇.电力系统长、中、短期能源调度管理系统的研究[J].中国电机工程学报,1994,14(6):41-48. 被引量:14
  • 3丘文千.抽水蓄能电站运行优化的动态规划模型[J].水电自动化与大坝监测,2005,29(6):6-10. 被引量:19
  • 4崔继纯,刘殿海,梁维列,谢枫,陈宏宇.抽水蓄能电站经济环保效益分析[J].中国电力,2007,40(1):5-10. 被引量:26
  • 5Ferrero R W,Rivera J F,Shahidehpour S M.A dynamic programming two-stage algorithm for long-term hydrothermal scheduling of multireservoir systems[J].IEEE Transactions on Power Systems,1998,13(4):1534-1540.
  • 6Zambelli M,Siqueira T G,Cicogna M,et al.Deterministic versus stochastic models for long term hydrothermal scheduling[C]//IEEE Power Engineering Society General Meeting.Montreal,Que:IEEE,2006.
  • 7Pinto R J,Borges C L T,Maceira M E P.An efficient parallel algorithm for large scale hydrothermal system operation planning[J].IEEE Transactions on Power Systems,2013,28(4):4888-4896.
  • 8Baslis C G,Papadakis S E,Bakirtzis A G.Simulation of optimal medium-term hydro-thermal system operation by grid computing[J].IEEE Transactions on Power Systems,2009,24(3):1208-1217.
  • 9Bertsekas D P.Dynamic Programming and Optimal Control[M].Belmont,MA:Athena Scientific,2005.
  • 10Powell W B.Approximate Dynamic Programming:Solving the Curses of Dimensionality[M].2nd ed.New York:JOHN WILEY&SONS,INC.,2011.

共引文献27

同被引文献103

引证文献12

二级引证文献48

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部