Combining latent learning with dynamic programming in the modular anticipatory classifier system

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
9664057	1446255	2005	24 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Dynamic programming - برنامه‌ریزی پویا یا برنامه‌ نویسی پویا learning classifier systems - سیستم طبقه بندی یادگیری artificial intelligence - هوش مصنوعی Latent learning - یادگیری بی نظیر Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش صفحه اول مقاله

Combining latent learning with dynamic programming in the modular anticipatory classifier system

چکیده انگلیسی

Learning Classifier Systems (LCS) are rule based Reinforcement Learning (RL) systems which use a generalization capability. In this paper, we highlight the differences between two kinds of LCSs. Some are used to directly perform RL while others latently learn a model of the interactions between the agent and its environment. Such a model can be used to speed up the core RL process. Thus, these two kinds of learning processes are complementary. We show here how the notion of generalization differs depending on whether the system anticipates (like Anticipatory Classifier System (ACS) and Yet Another Classifier System (YACS)) or not (like XCS). Moreover, we show some limitations of the formalism common to ACS and YACS, and propose a new system, called Modular Anticipatory Classifier System (MACS), which allows the latent learning process to take advantage of new regularities. We describe how the model can be used to perform active exploration and how this exploration may be aggregated with the policy resulting from the reinforcement learning process. The different algorithms are validated experimentally and some limitations in presence of uncertainties are highlighted.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: European Journal of Operational Research - Volume 160, Issue 3, 1 February 2005, Pages 614-637

نویسندگان

Pierre Gérard, Jean-Arcady Meyer, Olivier Sigaud,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Combining latent learning with dynamic programming in the modular anticipatory classifier system

دسترسی سریع

ارتباط

English Website