کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
478902 | 1446176 | 2008 | 7 صفحه PDF | دانلود رایگان |
![عکس صفحه اول مقاله: Optimization of a special case of continuous-time Markov decision processes with compact action set Optimization of a special case of continuous-time Markov decision processes with compact action set](/preview/png/478902.png)
Performance optimization is considered for average-cost multichain Markov decision processes (MDPs) with compact action set. Since, for a general compact multichain model, the optimality equation system may have no solution, and also a policy iteration algorithm may yield a suboptimal policy rather than an optimal one, we concentrate only on a special case of multichain models in this paper, where we assume that the classifications of states are fixed identically rather than varying with policies. By using the concept of performance potentials, the existence of solutions to the optimality equation system is established, and then a potential-based policy iteration algorithm is supposed to solve this system. In addition, the optimality convergence, for recurrent classes, of the algorithm has been proved. Finally, a numerical example is provided.
Journal: European Journal of Operational Research - Volume 187, Issue 1, 16 May 2008, Pages 113–119