کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
404166 677393 2015 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Immediate return preference emerged from a synaptic learning rule for return maximization
ترجمه فارسی عنوان
اولویت بازگشت فوری از یک قانون یادگیری سیناپسی برای به حداکثر رساندن بازگشت ظاهر شد
کلمات کلیدی
انتخاب بین زمانی، تخفیف تاخیر، تقویت یادگیری، پلاستیک سیناپتیس
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Animals including human often prefer immediate returns to larger delayed returns. It holds true in the human communications. Standard interpretation of the immediate return preference is that an animal might subjectively discount the value of a delayed reward, and that might choose the larger valued one. The interpretation has been successfully applied to explain behavior of many species including human. However, the description is not necessarily sufficient to apply for interactions of individuals. This study adopts a different approach to seek a possibility that immediate return preference may be reproduced by learning rule to maximize objective outcomes. We show that a synaptic learning rule to achieve the temporal difference (TD) learning for outcome maximization fails the maximization and exhibits immediate return preference if the context is not properly represented as a internal state.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neural Networks - Volume 62, February 2015, Pages 83–90
نویسندگان
, , ,