Fitted value function iteration with probability one contractions

Article ID	Journal	Published Year	Pages	File Type
5098821	Journal of Economic Dynamics and Control	2013	14 Pages	PDF

Abstract

This paper studies a value function iteration algorithm based on nonexpansive function approximation and Monte Carlo integration that can be applied to almost all stationary dynamic programming problems. The method can be represented using a randomized fitted Bellman operator and a corresponding algorithm that is shown to be globally convergent with probability one. When additional restrictions are imposed, an OP(nâ1/2) rate of convergence for Monte Carlo error is obtained.

Keywords

C61 C63 Dynamic programming Value function iteration Monte Carlo