A note on deterministic approximation of discounted Markov decision processes

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
1709036	1012839	2009	5 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Deterministic approximation Markov decision process - روند تصمیم گیری مارکوف Rate of convergence - نرخ همگرایی

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی مکانیک محاسباتی

پیش نمایش صفحه اول مقاله

A note on deterministic approximation of discounted Markov decision processes

چکیده انگلیسی

We study the approximation of a small-noise Markov decision process xt=F(xt−1,at,ξt(ϵ))xt=F(xt−1,at,ξt(ϵ)), t=1,2,…t=1,2,… by means of its deterministic counterpart: x˜t=F(x˜t−1,at,s0), t=1,2,…t=1,2,… where s0s0 is a fixed point of the disturbance metric space (S,r)(S,r). The total discounted cost is used as a criterion of optimality. Supposing that δϵ≔Er(ξ1(ϵ),s0)→0δϵ≔Er(ξ1(ϵ),s0)→0 as ϵ→0ϵ→0, we prove the convergence of optimal policies, estimate the rate of convergence of the optimal costs and give an upper bound (depending on δϵδϵ) for the stability index, which measures the excess of the cost due to a replacement of the optimal policy by its deterministic approximation.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Mathematics Letters - Volume 22, Issue 8, August 2009, Pages 1252–1256

نویسندگان

Hugo Cruz-Suárez, Evgueni Gordienko, Raúl Montes-de-Oca,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A note on deterministic approximation of discounted Markov decision processes

دسترسی سریع

ارتباط

English Website