The lagging anchor model for game learning-a solution to the Crawford puzzle

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10437781	912430	2005	17 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Convergence - همگرایی Reinforcement learning - یادگیری تقویتی Learning in games - یادگیری در بازی ها

موضوعات مرتبط

علوم انسانی و اجتماعی اقتصاد، اقتصادسنجی و امور مالی اقتصاد و اقتصادسنجی

پیش نمایش صفحه اول مقاله

The lagging anchor model for game learning-a solution to the Crawford puzzle

چکیده انگلیسی

In matrix games with fully mixed solutions, simultaneous gradient ascent by both players does not converge, a fact known as the Crawford puzzle. We suggest the lagging anchor learning model, which we prove to give convergence, as a solution to this puzzle. Our learning model can be viewed as a reinforcement learning process where the players perform relatively little computation. We compare our learning model with other published solutions to the puzzle. We also prove a generalization of the Crawford puzzle by identifying a broad class of learning rules that cannot produce exponential stability of solution points.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Economic Behavior & Organization - Volume 57, Issue 3, July 2005, Pages 287-303

نویسندگان

F.A. Dahl,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

The lagging anchor model for game learning-a solution to the Crawford puzzle

دسترسی سریع

ارتباط

English Website