Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
1709036 | Applied Mathematics Letters | 2009 | 5 Pages |
Abstract
We study the approximation of a small-noise Markov decision process xt=F(xt−1,at,ξt(ϵ))xt=F(xt−1,at,ξt(ϵ)), t=1,2,…t=1,2,… by means of its deterministic counterpart: x˜t=F(x˜t−1,at,s0), t=1,2,…t=1,2,… where s0s0 is a fixed point of the disturbance metric space (S,r)(S,r). The total discounted cost is used as a criterion of optimality. Supposing that δϵ≔Er(ξ1(ϵ),s0)→0δϵ≔Er(ξ1(ϵ),s0)→0 as ϵ→0ϵ→0, we prove the convergence of optimal policies, estimate the rate of convergence of the optimal costs and give an upper bound (depending on δϵδϵ) for the stability index, which measures the excess of the cost due to a replacement of the optimal policy by its deterministic approximation.
Related Topics
Physical Sciences and Engineering
Engineering
Computational Mechanics
Authors
Hugo Cruz-Suárez, Evgueni Gordienko, Raúl Montes-de-Oca,