کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1709036 1012839 2009 5 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A note on deterministic approximation of discounted Markov decision processes
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مکانیک محاسباتی
پیش نمایش صفحه اول مقاله
A note on deterministic approximation of discounted Markov decision processes
چکیده انگلیسی

We study the approximation of a small-noise Markov decision process xt=F(xt−1,at,ξt(ϵ))xt=F(xt−1,at,ξt(ϵ)), t=1,2,…t=1,2,… by means of its deterministic counterpart: x˜t=F(x˜t−1,at,s0), t=1,2,…t=1,2,… where s0s0 is a fixed point of the disturbance metric space (S,r)(S,r). The total discounted cost is used as a criterion of optimality. Supposing that δϵ≔Er(ξ1(ϵ),s0)→0δϵ≔Er(ξ1(ϵ),s0)→0 as ϵ→0ϵ→0, we prove the convergence of optimal policies, estimate the rate of convergence of the optimal costs and give an upper bound (depending on δϵδϵ) for the stability index, which measures the excess of the cost due to a replacement of the optimal policy by its deterministic approximation.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Mathematics Letters - Volume 22, Issue 8, August 2009, Pages 1252–1256
نویسندگان
, , ,