کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4496222 1623870 2014 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The application of temporal difference learning in optimal diet models
ترجمه فارسی عنوان
استفاده از یادگیری تفاوت زمانی در مدل های رژیم غذایی مطلوب
کلمات کلیدی
رژیم مطلوب، تقلید بتسیان، شکارچی شکار، نمونه برداری طعم یادگیری تفاوت زمانی
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
چکیده انگلیسی

Author-Highlights
• We apply model-free reinforcement learning to optimal diet models.
• The presented model incorporates uncertainty of changing environments.
• The model predicts effects of Batesian mimics and aposematism on predators diet choice and energy intake.
• The model uses a precondition of exploration of the action space for successful aversion formation.
• Conflicting rewards lead to foraging behaviour which is conditionally suboptimal in fixed environments but allows better adaptation in changing environments.

An experience-based aversive learning model of foraging behaviour in uncertain environments is presented. We use Q-learning as a model-free implementation of Temporal difference learning motivated by growing evidence for neural correlates in natural reinforcement settings. The predator has the choice of including an aposematic prey in its diet or to forage on alternative food sources. We show how the predator's foraging behaviour and energy intake depend on toxicity of the defended prey and the presence of Batesian mimics. We introduce the precondition of exploration of the action space for successful aversion formation and show how it predicts foraging behaviour in the presence of conflicting rewards which is conditionally suboptimal in a fixed environment but allows better adaptation in changing environments.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 340, 7 January 2014, Pages 11–16
نویسندگان
, , ,