کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
402405 676936 2012 5 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Dynamic randomization and domain knowledge in Monte-Carlo Tree Search for Go knowledge-based systems
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Dynamic randomization and domain knowledge in Monte-Carlo Tree Search for Go knowledge-based systems
چکیده انگلیسی

This paper is an extension of the article [13] presented at IWCG of TAAI 2010. It proposes two dynamic randomization techniques for Monte-Carlo Tree Search (MCTS) in Go. First, during the in-tree phase of a simulation game, the parameters are randomized in selected ranges before each simulation move. Second, during the play-out phase, the priority orders of the simulation move-generators are hierarchically randomized before each play-out move. Essential domain knowledge used in MCTS for Go is discussed. Both dynamic randomization techniques increase diversity while keeping the sanity of the simulation games. Experimental testing has been completely re-conducted more extensively with the latest version of GoIntellect (GI) on all three Go categories of 19 × 19, 13 × 13, and 9 × 9 boards. The results show that dynamic randomization increases the playing strength of GI significantly with 128K simulations per move, the improvement is about seven percentage points in the winning rate against GnuGo on 19 × 19 Go over the version of GI without dynamic randomization, about three percentage points on 13 × 13 Go, and four percentage points on 9 × 9 Go.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 34, October 2012, Pages 21–25
نویسندگان
,