کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
474535 698908 2006 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Discounted Markov decision processes with utility constraints
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
Discounted Markov decision processes with utility constraints
چکیده انگلیسی

We consider utility-constrained Markov decision processes. The expected utility of the total discounted reward is maximized subject to multiple expected utility constraints. By introducing a corresponding Lagrange function, a saddle-point theorem of the utility constrained optimization is derived. The existence of a constrained optimal policy is characterized by optimal action sets specified with a parametric utility.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Mathematics with Applications - Volume 51, Issue 2, January 2006, Pages 279-284