Article ID Journal Published Year Pages File Type
1709573 Applied Mathematics Letters 2008 6 Pages PDF
Abstract
This work concerns discrete-time Markov control processes with unbounded costs and unknown disturbance distribution θ. Assuming observability of the random disturbance, we estimate θ using its empirical estimator, which, combined with a variant of the vanishing discount factor approach, yields average cost optimal policies.
Related Topics
Physical Sciences and Engineering Engineering Computational Mechanics
Authors
,