The Borkar–Meyn theorem for asynchronous stochastic approximations

Article ID	Journal	Published Year	Pages	File Type
756524	Systems & Control Letters	2011	7 Pages	PDF

Abstract

In this paper, we give a generalization of a result by Borkar and Meyn (2000) [1], on the stability and convergence of synchronous-update stochastic approximation algorithms, to the case of asynchronous stochastic approximations with delays. We then describe an interesting application of the result to asynchronous distributed temporal difference (TD) learning with function approximation and delays.

Keywords

Temporal difference learning