Article ID Journal Published Year Pages File Type
5098821 Journal of Economic Dynamics and Control 2013 14 Pages PDF
Abstract
This paper studies a value function iteration algorithm based on nonexpansive function approximation and Monte Carlo integration that can be applied to almost all stationary dynamic programming problems. The method can be represented using a randomized fitted Bellman operator and a corresponding algorithm that is shown to be globally convergent with probability one. When additional restrictions are imposed, an OP(n−1/2) rate of convergence for Monte Carlo error is obtained.
Related Topics
Physical Sciences and Engineering Mathematics Control and Optimization
Authors
, ,