کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
380757 1437454 2013 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game
چکیده انگلیسی


• We propose new modular structure for multi-agent reinforcement learning.
• We increase the learning rate by introducing new partial and single modules.
• Q-learning performance is improved for states with insufficient experience.
• New methods are proposed for more reliable decision making.

Multi-agent reinforcement learning methods suffer from several deficiencies that are rooted in the large state space of multi-agent environments. This paper tackles two deficiencies of multi-agent reinforcement learning methods: their slow learning rate, and low quality decision-making in early stages of learning. The proposed methods are applied in a grid-world soccer game. In the proposed approach, modular reinforcement learning is applied to reduce the state space of the learning agents from exponential to linear in terms of the number of agents. The modular model proposed here includes two new modules, a partial-module and a single-module. These two new modules are effective for increasing the speed of learning in a soccer game. We also apply the instance-based learning concepts, to choose proper actions in states that are not experienced adequately during learning. The key idea is to use neighbouring states that have been explored sufficiently during the learning phase. The results of experiments in a grid-soccer game environment show that our proposed methods produce a higher average reward compared to the situation where the proposed method is not applied to the modular structure.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Engineering Applications of Artificial Intelligence - Volume 26, Issue 9, October 2013, Pages 2164–2171
نویسندگان
, , , ,