کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4618494 1339408 2011 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
On the adaptive control of a class of partially observed Markov decision processes
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات آنالیز ریاضی
پیش نمایش صفحه اول مقاله
On the adaptive control of a class of partially observed Markov decision processes
چکیده انگلیسی

This paper is concerned with the adaptive control problem, over the infinite horizon, for partially observable Markov decision processes whose transition functions are parameterized by an unknown vector. We treat finite models and impose relatively mild assumptions on the transition function. Provided that a sequence of parameter estimates converging in probability to the true parameter value is available, we show that the certainty equivalence adaptive policy is optimal in the long-run average sense.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Mathematical Analysis and Applications - Volume 380, Issue 1, 1 August 2011, Pages 1-9