کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
408339 679021 2011 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Robust high performance reinforcement learning through weighted k-nearest neighbors
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Robust high performance reinforcement learning through weighted k-nearest neighbors
چکیده انگلیسی

The aim of this paper is to present (jointly) a series of robust high performance (award winning) implementations of reinforcement learning algorithms based on temporal-difference learning and weighted kk- nearest neighbors for linear function approximation. These algorithms, named kNN‐TD(λ)kNN‐TD(λ) methods, where rigorously tested at the Second and Third Annual Reinforcement Learning Competitions (RLC2008 and RCL2009) held in Helsinki and Montreal respectively, where the kNN‐TD(λ)kNN‐TD(λ) method (JAMH team) won in the PolyAthlon 2008 domain, obtained the second place in 2009 and also the second place in the Mountain-Car 2008 domain showing that it is one of the state of the art general purpose reinforcement learning implementations. These algorithms are able to learn quickly, to generalize properly over continuous state spaces and also to be robust to a high degree of environmental noise. Furthermore, we describe a derivation of kNN‐TD(λ)kNN‐TD(λ) algorithm for problems where the use of continuous actions have clear advantages over the use of fine grained discrete actions: the Ex〈a〉Ex〈a〉 reinforcement learning algorithm.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 74, Issue 8, 15 March 2011, Pages 1251–1259
نویسندگان
, , ,