期刊文献+

求解多目标暂态电压紧急控制的强化学习方法 被引量:2

Reinforcement Learning Method Applied to Multiobjective Emergency Control of Transient Voltage Security
下载PDF
导出
摘要 暂态电压崩溃事故严重威胁电网安全,迫切需要采取相应紧急控制.以发电机端电压参考值调节量和容抗器无功投切量为控制变量,利用轨迹灵敏度搭建多目标暂态电压安全紧急控制模型,分两阶段最小化关键负荷节点电压偏差、控制代价和发电机无功出力比例的方差.采用简化强化学习方法求解该模型,重设解空间状态函数并调整动作幅度,引进状态敏感度解决探索和应用的矛盾.将可行域划分为若干小区域,单独评判它们存在最优解的可能,缩小搜索范围.通过优化搜索策略进一步提高帕累托前沿质量,并依据实际运行状况拟定目标函数权重并确定折中解.在某省级电网进行时域仿真,结果表明,所提出方法能将暂态电压纠正到安全状态,且在求解效率和帕累托前沿质量方面比法线边界交叉法优越. Transient voltage collapse poses a serious threat to the security of power grid,which results in an urgent need for emergency control. In this paper,first,by taking the reference value increments of generator terminal voltages and the reactive power outputs of capacitors / reactors as the control variables,a multiobjective emergency control model for ensuring transient voltage security is constructed by using trajectory sensitivity. In the model,the deviation of voltages at key load nodes,the control cost and the variance of the reactive power output ratio of generators are minimized respectively in two stages. Next,the proposed model is solved by means of the reduced reinforcement learning method which resets the state functions of solution space and adjusts the magnitude of actions,and the state sensitivity is introduced to solve the conflicts between exploration and application. Then,the feasible region is divided into small zones,so that the probability that there may be optimal solutions in each zone can be judged alone and the search range is thus narrowed. Moreover,the quality of pareto frontier is further improved by optimizing the searching strategies,and the weights corresponding to the objective functions are determined according to the actual operating status,with the compromise optimal solution being given. Finally,the time-domain simulation is performed on a provincial power grid. It is found that the proposed method can restore the transient voltage security and is superior to the normal boundary intersection method in terms of the solution efficiency and the quality of PF.
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2015年第12期9-17,共9页 Journal of South China University of Technology(Natural Science Edition)
基金 国家自然科学基金资助项目(51277078)~~
关键词 暂态电压安全 紧急控制 轨迹灵敏度 多目标优化 强化学习 transient voltage security emergency control trajectory sensitivity multiobjective optimization reinforcement learning
  • 相关文献

参考文献6

二级参考文献97

共引文献50

同被引文献65

引证文献2

二级引证文献120

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部