Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
5098821 | Journal of Economic Dynamics and Control | 2013 | 14 Pages |
Abstract
This paper studies a value function iteration algorithm based on nonexpansive function approximation and Monte Carlo integration that can be applied to almost all stationary dynamic programming problems. The method can be represented using a randomized fitted Bellman operator and a corresponding algorithm that is shown to be globally convergent with probability one. When additional restrictions are imposed, an OP(nâ1/2) rate of convergence for Monte Carlo error is obtained.
Related Topics
Physical Sciences and Engineering
Mathematics
Control and Optimization
Authors
JenÅ Pál, John Stachurski,