Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
756524 | Systems & Control Letters | 2011 | 7 Pages |
Abstract
In this paper, we give a generalization of a result by Borkar and Meyn (2000) [1], on the stability and convergence of synchronous-update stochastic approximation algorithms, to the case of asynchronous stochastic approximations with delays. We then describe an interesting application of the result to asynchronous distributed temporal difference (TD) learning with function approximation and delays.
Keywords
Related Topics
Physical Sciences and Engineering
Engineering
Control and Systems Engineering
Authors
Shalabh Bhatnagar,