Transient and asymptotic dynamics of reinforcement learning in games

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
5072650	1373512	2007	19 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Slow learning C73 Stochastic approximation - تقریبی تصادفی Reinforcement learning - یادگیری تقویتی Learning in games - یادگیری در بازی ها

موضوعات مرتبط

علوم انسانی و اجتماعی اقتصاد، اقتصادسنجی و امور مالی اقتصاد و اقتصادسنجی

پیش نمایش صفحه اول مقاله

Transient and asymptotic dynamics of reinforcement learning in games

چکیده انگلیسی

Reinforcement learners tend to repeat actions that led to satisfactory outcomes in the past, and avoid choices that resulted in unsatisfactory experiences. This behavior is one of the most widespread adaptation mechanisms in nature. In this paper we fully characterize the dynamics of one of the best known stochastic models of reinforcement learning [Bush, R., Mosteller, F., 1955. Stochastic Models of Learning. Wiley & Sons, New York] for 2-player 2-strategy games. We also provide some extensions for more general games and for a wider class of learning algorithms. Specifically, it is shown that the transient dynamics of Bush and Mosteller's model can be substantially different from its asymptotic behavior. It is also demonstrated that in general-and in sharp contrast to other reinforcement learning models in the literature-the asymptotic dynamics of Bush and Mosteller's model cannot be approximated using the continuous time limit version of its expected motion.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Games and Economic Behavior - Volume 61, Issue 2, November 2007, Pages 259-276

نویسندگان

Luis R. Izquierdo, Segismundo S. Izquierdo, Nicholas M. Gotts, J. Gary Polhill,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Transient and asymptotic dynamics of reinforcement learning in games

دسترسی سریع

ارتباط

English Website