Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
696628	890343	2011	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton–Jacobi equations

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Coupled Riccati equations Multi-player games Nash equilibrium - تعادل نش Persistence of excitation - پایداری تحریک Adaptive optimal control - کنترل بهینه سازگار

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی

پیش نمایش صفحه اول مقاله

Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton–Jacobi equations

چکیده انگلیسی

In this paper we present an online adaptive control algorithm based on policy iteration reinforcement learning techniques to solve the continuous-time (CT) multi player non-zero-sum (NZS) game with infinite horizon for linear and nonlinear systems. NZS games allow for players to have a cooperative team component and an individual selfish component of strategy. The adaptive algorithm learns online the solution of coupled Riccati equations and coupled Hamilton–Jacobi equations for linear and nonlinear systems respectively. This adaptive control method finds in real-time approximations of the optimal value and the NZS Nash-equilibrium, while also guaranteeing closed-loop stability. The optimal-adaptive algorithm is implemented as a separate actor/critic parametric network approximator structure for every player, and involves simultaneous continuous-time adaptation of the actor/critic networks. A persistence of excitation condition is shown to guarantee convergence of every critic to the actual optimal value function for that player. A detailed mathematical analysis is done for 2-player NZS games. Novel tuning algorithms are given for the actor/critic networks. The convergence to the Nash equilibrium is proven and stability of the system is also guaranteed. This provides optimal adaptive control solutions for both non-zero-sum games and their special case, the zero-sum games. Simulation examples show the effectiveness of the new algorithm.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Automatica - Volume 47, Issue 8, August 2011, Pages 1556–1569

نویسندگان

Kyriakos G. Vamvoudakis, Frank L. Lewis,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton–Jacobi equations

دسترسی سریع

ارتباط

English Website