Guiding exploration by pre-existing knowledge without modifying reward

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
404783	677451	2007	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

exploration - جهانگردی یا اکتشاف short-term memory - حافظه کوتاه مدت Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Guiding exploration by pre-existing knowledge without modifying reward

چکیده انگلیسی

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many applications receiving even the first reward may require long exploration, during which the agent has no information about its progress.This paper presents an approach that makes it possible to use pre-existing knowledge about the task for guiding exploration through the state space. Concepts of short- and long-term memory combine guidance by pre-existing knowledge with reinforcement learning methods for value function estimation in order to make learning faster while allowing the agent to converge towards a good policy.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neural Networks - Volume 20, Issue 6, August 2007, Pages 736–747

نویسندگان

Kary Främling,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Guiding exploration by pre-existing knowledge without modifying reward

دسترسی سریع

ارتباط

English Website