期刊文献+

半Markov控制过程在折扣代价准则下的最优平稳策略 被引量:1

Optimal stationary policies for semi-Markov control processes with discounted-cost criteria
下载PDF
导出
摘要 讨论一类半Markov控制过程(SMCP)的折扣代价性能优化问题.通过引入一个矩阵,该矩阵可作为一个Markov过程的无穷小矩阵,对一个SMCP定义了折扣Poisson方程,并由这个方程定义了α 势.基于α 势,给出了由最优平稳策略所满足的最优性方程.最后给出一个求解最优平稳策略的迭代算法,并提供一个数值例子以表明该算法的应用. The problems of discounted-cost performance optimization are discussed for a class of semi-Markov control processes (SMCP). A matrix is defined, which can be as the infinitesimal generator of a Markov process. The discounted Poisson equation is proposed for an SMCP by using this matrix, from which the α-potential is defined. Based on the α-potential, the optimality equation satisfied by the optimal stationary policy is given. Finally an iteration algorithm to find an optimal stationary policy is proposed, and a numerical example is provided to illustrate the application of the algorithm.
出处 《控制与决策》 EI CSCD 北大核心 2004年第6期691-694,共4页 Control and Decision
基金 国家自然科学基金资助项目(60274012) 安徽省自然科学基金资助项目(01042308).
关键词 半Markovr控制过程 折扣代价准则 折扣Poisson方程 α-势 最优性方程 最优平稳策略 Decision making Iterative methods Optimal systems Poisson equation
  • 相关文献

参考文献8

  • 1Howard R.Semi-Markovian decision processes[J].Inst Intcernat Statist,1963,40:625-652.
  • 2Ross S M.Applied Probability Models with Optimization Applications[M].San Franciso: Holden-Day,1971.
  • 3Beutler F J,Ross K W.Uniformization for semi-Markov decision processes under stationary policies[J].J Appl Prob,1987,24: 644-656.
  • 4Puterman M L.Markov Decision Processes[M].New York: John Wiley,1994.
  • 5Cinlar E.Introduction to Stochastic Processes[M].Englewood Cliffs: Prentice-Hall Inc,1975.
  • 6奚宏生,唐昊,殷保群.连续时间MCP在紧致行动集上的最优策略(英文)[J].自动化学报,2003,29(2):206-211. 被引量:12
  • 7Tang Hao,Xi Hongsheng,Yin Baoqun.Performance optimization of continuous-time Markov control processes based on performance potentials[J].Int J of Systems Science,2003,34(1):63-71.
  • 8Jewell W S.Markov renewal programming: Ⅰ,Ⅱ[J].Operat Res,1963,2:938-971.

二级参考文献2

共引文献11

同被引文献8

  • 1宋京生.转移概率族非一致有界的连续时间马氏决策规划[J].中国科学:A辑,1987,12:1258-1267.
  • 2Ross S M. Applied Probability Models with Optimization Applications [M]. San Franciso: Holden-Day, 1971.
  • 3Yushkevich A A. Controlled Markov Models with Countable State Space and Continuous Time [J].Theory Probability and Applications, 1977,22 (2) : 215 -235.
  • 4Guo X P, Zhu W P. Denumerable-state Continuous Time Markov Decision Processes with Unbounded Transition and Reward Rates Under the Discounted Criterion [J]. J of Applied Probability, 2002, 39 (2) :233-250.
  • 5Puterman M L. Markov Decision Processes: Discrete Stochastic Dynamic Programming [M], New York:John Wiley, 1994.
  • 6Cao X R. Semi-Markov Decision Problems and Performance Sensitivity Analysis [J]. IEEE Trans on Automatical Control, 2003,48 (5): 758-769.
  • 7Dai G P, Yin B Q, Li Y J, et al. Performance Optimization Algorithms Based on Potential for Semi-Markov Control Processes[J]. Int J of Control, 2005,78(11) : 801-812.
  • 8Cinlar E. Introduction to Stochastic Processes[M ].Englewood Cliffs, New Jersey: Prentice-Hall, Ine,1975.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部