| Article ID | Journal | Published Year | Pages | File Type | 
|---|---|---|---|---|
| 1709573 | Applied Mathematics Letters | 2008 | 6 Pages | 
Abstract
												This work concerns discrete-time Markov control processes with unbounded costs and unknown disturbance distribution θ. Assuming observability of the random disturbance, we estimate θ using its empirical estimator, which, combined with a variant of the vanishing discount factor approach, yields average cost optimal policies.
											Keywords
												
											Related Topics
												
													Physical Sciences and Engineering
													Engineering
													Computational Mechanics
												
											Authors
												J. Adolfo Minjárez-Sosa, 
											