Learning to play Go using recursive neural networks

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10326536	678144	2008	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Computer Go Knowledge transfer - انتقال دانش Computer games - بازی های کامپیوتری evaluation function - تابع ارزیابی Recursive neural networks - شبکه های عصبی بازگشتی Machine learning - یادگیری ماشین

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Learning to play Go using recursive neural networks

چکیده انگلیسی

Go is an ancient board game that poses unique opportunities and challenges for artificial intelligence. Currently, there are no computer Go programs that can play at the level of a good human player. However, the emergence of large repositories of games is opening the door for new machine learning approaches to address this challenge. Here we develop a machine learning approach to Go, and related board games, focusing primarily on the problem of learning a good evaluation function in a scalable way. Scalability is essential at multiple levels, from the library of local tactical patterns, to the integration of patterns across the board, to the size of the board itself. The system we propose is capable of automatically learning the propensity of local patterns from a library of games. Propensity and other local tactical information are fed into recursive neural networks, derived from a probabilistic Bayesian network architecture. The recursive neural networks in turn integrate local information across the board in all four cardinal directions and produce local outputs that represent local territory ownership probabilities. The aggregation of these probabilities provides an effective strategic evaluation function that is an estimate of the expected area at the end, or at various other stages, of the game. Local area targets for training can be derived from datasets of games played by human players. In this approach, while requiring a learning time proportional to N4, skills learned on a board of size N2 can easily be transferred to boards of other sizes. A system trained using only 9Ã9 amateur game data performs surprisingly well on a test set derived from 19Ã19 professional game data. Possible directions for further improvements are briefly discussed.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neural Networks - Volume 21, Issue 9, November 2008, Pages 1392-1400

نویسندگان

Lin Wu, Pierre Baldi,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Learning to play Go using recursive neural networks

دسترسی سریع

ارتباط

English Website