کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
478902 1446176 2008 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Optimization of a special case of continuous-time Markov decision processes with compact action set
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
Optimization of a special case of continuous-time Markov decision processes with compact action set
چکیده انگلیسی

Performance optimization is considered for average-cost multichain Markov decision processes (MDPs) with compact action set. Since, for a general compact multichain model, the optimality equation system may have no solution, and also a policy iteration algorithm may yield a suboptimal policy rather than an optimal one, we concentrate only on a special case of multichain models in this paper, where we assume that the classifications of states are fixed identically rather than varying with policies. By using the concept of performance potentials, the existence of solutions to the optimality equation system is established, and then a potential-based policy iteration algorithm is supposed to solve this system. In addition, the optimality convergence, for recurrent classes, of the algorithm has been proved. Finally, a numerical example is provided.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: European Journal of Operational Research - Volume 187, Issue 1, 16 May 2008, Pages 113–119
نویسندگان
, , ,