Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
489077 | Procedia Computer Science | 2011 | 6 Pages |
Abstract
In this paper we use parallel processing to combine value functions in order to speedup reinforcement learning. We propose an asynchronous method of periodically composing Q table of local learning clusters to form global Q table. In this research, two approaches are implemented. First is discontinuance learning. Second is combination of value function by asynchronous communication. The asynchronous combination method is compared with a synchronous combination method in order of learning times. A cluster of 40 PCs were used in the experiments are presented. The convergence time and learning times are evaluated and discussed.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)