کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4614592 1339294 2016 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Near optimality of quantized policies in stochastic control under weak continuity conditions
ترجمه فارسی عنوان
نزدیک به مطلوب بودن سیاست های کوانتومی در کنترل تصادفی در شرایط تداوم ضعیف
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات آنالیز ریاضی
چکیده انگلیسی

This paper studies the approximation of optimal control policies by quantized (discretized) policies for a very general class of Markov decision processes (MDPs). The problem is motivated by applications in networked control systems, computational methods for MDPs, and learning algorithms for MDPs. We consider the finite-action approximation of stationary policies for a discrete-time Markov decision process with discounted and average costs under a weak continuity assumption on the transition probability, which is a significant relaxation of conditions required in earlier literature. The discretization is constructive, and quantized policies are shown to approximate optimal deterministic stationary policies with arbitrary precision. The results are applied to the fully observed reduction of a partially observed Markov decision process, where weak continuity is a much more reasonable assumption than more stringent conditions such as strong continuity or continuity in total variation.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Mathematical Analysis and Applications - Volume 435, Issue 1, 1 March 2016, Pages 321–337
نویسندگان
, , ,