کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
404503 677431 2008 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
چکیده انگلیسی

Understanding the design principle of reward functions is a substantial challenge both in artificial intelligence and neuroscience. Successful acquisition of a task usually requires not only rewards for goals, but also for intermediate states to promote effective exploration. This paper proposes a method for designing ‘intrinsic’ rewards of autonomous agents by combining constrained policy gradient reinforcement learning and embodied evolution. To validate the method, we use Cyber Rodent robots, in which collision avoidance, recharging from battery packs, and ‘mating’ by software reproduction are three major ‘extrinsic’ rewards. We show in hardware experiments that the robots can find appropriate ‘intrinsic’ rewards for the vision of battery packs and other robots to promote approach behaviors.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neural Networks - Volume 21, Issue 10, December 2008, Pages 1447–1455
نویسندگان
, ,