کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
408062 678242 2011 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The distribution of rewards in sensorimotor maps acquired by cognitive robots through exploration
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
The distribution of rewards in sensorimotor maps acquired by cognitive robots through exploration
چکیده انگلیسی

To exhibit intelligent behavior, cognitive robots must have some knowledge about the consequences of their actions and their value in the context of the goal being realized. We present a neural framework using which explorative sensorimotor experiences of cognitive robots can be efficiently ‘internalized’ using growing sensorimotor maps and planning realized using goal induced quasi-stationary value fields. Further, if there are no predefined reward functions (or the case when they are not good enough in a slightly modified world), the robot will have to try and realize its goal by exploration after which reward/penalty is given at the end. This paper proposes three simple rules for distribution of the received end reward among the contributing neurons in a high dimensional sensorimotor map. Importantly, reward/penalty distribution over hundreds of neurons in the sensorimotor map is computed one shot. This resulting reward distribution can be visualized as an additional value field, representing the new learnt experience and can be combined with other such fields in a context dependent fashion to plan/compose novel emergent behavior. The simplicity and efficiency of the approach is illustrated through the resulting behaviors of the GNOSYS robot in two different scenarios (a) learning ‘when’ to optimize ‘what constraint’ while realizing spatial goals and (b) learning to push a ball intelligently to the corners of a table, while avoiding traps randomly placed by the teacher (this scenario replicates the famous trap tube paradigm from animal reasoning carried out on chimpanzees, capuchins and infants).

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 74, Issue 17, October 2011, Pages 3440–3455
نویسندگان
, , , ,