Self-teaching adaptive dynamic programming for Gomoku

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
412651	679673	2012	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

adaptive dynamic programming - برنامه ریزی پویا تطبیقی Neural network - شبکه عصبی Temporal difference learning - یادگیری تفاوت زمانی Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Self-teaching adaptive dynamic programming for Gomoku

چکیده انگلیسی

In this paper adaptive dynamic programming (ADP) is applied to learn to play Gomoku. The critic network is used to evaluate board situations. The basic idea is to penalize the last move taken by the loser and reward the last move selected by the winner at the end of a game. The results show that the presented program is able to improve its performance by playing against itself and has approached the candidate level of a commercial Gomoku program called 5-star Gomoku. We also examined the influence of two methods for generating games: self-teaching and learning through watching two experts playing against each other and presented the comparison results and reasons.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 78, Issue 1, 15 February 2012, Pages 23–29

نویسندگان

Dongbin Zhao, Zhen Zhang, Yujie Dai,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Self-teaching adaptive dynamic programming for Gomoku

دسترسی سریع

ارتباط

English Website