کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1714039 1013261 2008 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی
پیش نمایش صفحه اول مقاله
Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices
چکیده انگلیسی

The generalization of policies in reinforcement learning is a main issue, both from the theoretical model point of view and for their applicability. However, generalizing from a set of examples or searching for regularities is a problem which has already been intensively studied in machine learning. Thus, existing domains such as Inductive Logic Programming have already been linked with reinforcement learning. Our work uses techniques in which generalizations are constrained by a language bias, in order to regroup similar states. Such generalizations are principally based on the properties of concept lattices. To guide the possible groupings of similar states of the environment, we propose a general algebraic framework, considering the generalization of policies through a partition of the set of states and using a language bias as an a priori knowledge. We give a practical application as an example of our theoretical approach by proposing and experimenting a bottom-up algorithm.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Nonlinear Analysis: Hybrid Systems - Volume 2, Issue 2, June 2008, Pages 684–694
نویسندگان
, ,