کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
10437781 | 912430 | 2005 | 17 صفحه PDF | دانلود رایگان |
عنوان انگلیسی مقاله ISI
The lagging anchor model for game learning-a solution to the Crawford puzzle
دانلود مقاله + سفارش ترجمه
دانلود مقاله ISI انگلیسی
رایگان برای ایرانیان
کلمات کلیدی
موضوعات مرتبط
علوم انسانی و اجتماعی
اقتصاد، اقتصادسنجی و امور مالی
اقتصاد و اقتصادسنجی
پیش نمایش صفحه اول مقاله

چکیده انگلیسی
In matrix games with fully mixed solutions, simultaneous gradient ascent by both players does not converge, a fact known as the Crawford puzzle. We suggest the lagging anchor learning model, which we prove to give convergence, as a solution to this puzzle. Our learning model can be viewed as a reinforcement learning process where the players perform relatively little computation. We compare our learning model with other published solutions to the puzzle. We also prove a generalization of the Crawford puzzle by identifying a broad class of learning rules that cannot produce exponential stability of solution points.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Economic Behavior & Organization - Volume 57, Issue 3, July 2005, Pages 287-303
Journal: Journal of Economic Behavior & Organization - Volume 57, Issue 3, July 2005, Pages 287-303
نویسندگان
F.A. Dahl,