Robust high performance reinforcement learning through weighted k-nearest neighbors

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
408339	679021	2011	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Temporal-difference learning Function approximation - تقریب تابع Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Robust high performance reinforcement learning through weighted k-nearest neighbors

چکیده انگلیسی

The aim of this paper is to present (jointly) a series of robust high performance (award winning) implementations of reinforcement learning algorithms based on temporal-difference learning and weighted kk- nearest neighbors for linear function approximation. These algorithms, named kNN‐TD(λ)kNN‐TD(λ) methods, where rigorously tested at the Second and Third Annual Reinforcement Learning Competitions (RLC2008 and RCL2009) held in Helsinki and Montreal respectively, where the kNN‐TD(λ)kNN‐TD(λ) method (JAMH team) won in the PolyAthlon 2008 domain, obtained the second place in 2009 and also the second place in the Mountain-Car 2008 domain showing that it is one of the state of the art general purpose reinforcement learning implementations. These algorithms are able to learn quickly, to generalize properly over continuous state spaces and also to be robust to a high degree of environmental noise. Furthermore, we describe a derivation of kNN‐TD(λ)kNN‐TD(λ) algorithm for problems where the use of continuous actions have clear advantages over the use of fine grained discrete actions: the Ex〈a〉Ex〈a〉 reinforcement learning algorithm.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 74, Issue 8, 15 March 2011, Pages 1251–1259

نویسندگان

José Antonio Martín H, Javier de Lope, Darío Maravall,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Robust high performance reinforcement learning through weighted k-nearest neighbors

دسترسی سریع

ارتباط

English Website