دانلود رایگان مقاله: یک مدل برای تکامل یادگیری تقویت در بازی های نوسان

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
2416296	1552224	2015	28 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

A model for the evolution of reinforcement learning in fluctuating games

ترجمه فارسی عنوان

یک مدل برای تکامل یادگیری تقویت در بازی های نوسان

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

تکامل شناخت، قوانین یادگیری پایدار تکاملی، اکتشاف-استثمار تجارت، بازی های تکراری تعاملات اجتماعی، یادگیری آزمایشی و خطا

Repeated games - بازی های تکراری Social interactions - تعاملات اجتماعی Trial-and-error learning - یادگیری آزمایشی و خطا

موضوعات مرتبط

علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم دامی و جانورشناسی

پیش نمایش مقاله

یک مدل برای تکامل یادگیری تقویت در بازی های نوسان

چکیده انگلیسی

• We model natural selection on two learning rules in simple social interactions.
• We compare standard trial and error to learning based on hypothetical payoffs.
• We find that hypothetical reinforcement learning is not always selected for.
• Hypothetical reinforcement learning produces rational behaviour in one-shot games.
• Trial and error can prevail and establish cooperation in the Prisoner's Dilemma.

Many species are able to learn to associate behaviours with rewards as this gives fitness advantages in changing environments. Social interactions between population members may, however, require more cognitive abilities than simple trial-and-error learning, in particular the capacity to make accurate hypotheses about the material payoff consequences of alternative action combinations. It is unclear in this context whether natural selection necessarily favours individuals to use information about payoffs associated with nontried actions (hypothetical payoffs), as opposed to simple reinforcement of realized payoff. Here, we develop an evolutionary model in which individuals are genetically determined to use either trial-and-error learning or learning based on hypothetical reinforcements, and ask what is the evolutionarily stable learning rule under pairwise symmetric two-action stochastic repeated games played over the individual's lifetime. We analyse through stochastic approximation theory and simulations the learning dynamics on the behavioural timescale, and derive conditions where trial-and-error learning outcompetes hypothetical reinforcement learning on the evolutionary timescale. This occurs in particular under repeated cooperative interactions with the same partner. By contrast, we find that hypothetical reinforcement learners tend to be favoured under random interactions, but stable polymorphisms can also obtain where trial-and-error learners are maintained at a low frequency. We conclude that specific game structures can select for trial-and-error learning even in the absence of costs of cognition, which illustrates that cost-free increased cognition can be counterselected under social interactions.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Animal Behaviour - Volume 104, June 2015, Pages 87–114

نویسندگان

Slimane Dridi, Laurent Lehmann,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : یک مدل برای تکامل یادگیری تقویت در بازی های نوسان

دسترسی سریع

ارتباط

English Website