Learning to represent reward structure: A key to adapting to complex environments

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4351575	1615307	2012	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Value - ارزش Salience - اهمیت Decision - تصمیم گیری Dopamine - دوپامین Structure - ساختار یا سازه Reward - پاداش Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

علوم زیستی و بیوفناوری علم عصب شناسی علوم اعصاب (عمومی)

پیش نمایش صفحه اول مقاله

Learning to represent reward structure: A key to adapting to complex environments

چکیده انگلیسی

Predicting outcomes is a critical ability of humans and animals. The dopamine reward prediction error hypothesis, the driving force behind the recent progress in neural “value-based” decision making, states that dopamine activity encodes the signals for learning in order to predict a reward, that is, the difference between the actual and predicted reward, called the reward prediction error. However, this hypothesis and its underlying assumptions limit the prediction and its error as reactively triggered by momentary environmental events. Reviewing the assumptions and some of the latest findings, we suggest that the internal state representation is learned to reflect the environmental reward structure, and we propose a new hypothesis – the dopamine reward structural learning hypothesis – in which dopamine activity encodes multiplex signals for learning in order to represent reward structure in the internal state, leading to better reward prediction.

► Learning the reward structure is indispensable for learning the reward prediction.
► Learning the reward structure in the internal state yield better reward prediction.
► We propose a new hypothesis: the dopamine reward structural learning hypothesis.
► DA activity encodes multiplexed learning signals for the structure and prediction.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neuroscience Research - Volume 74, Issues 3–4, December 2012, Pages 177–183

نویسندگان

Hiroyuki Nakahara, Okihide Hikosaka,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Learning to represent reward structure: A key to adapting to complex environments

دسترسی سریع

ارتباط

English Website