دانلود رایگان مقاله: کنترل تقریبی بهینه مبتنی بر داده ها برای بازی های غیرضروری سیستم های چند نفره با استفاده از برنامه ریزی پویای تطبیقی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6864813	1439552	2018	28 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Data-based approximate optimal control for nonzero-sum games of multi-player systems using adaptive dynamic programming

ترجمه فارسی عنوان

کنترل تقریبی بهینه مبتنی بر داده ها برای بازی های غیرضروری سیستم های چند نفره با استفاده از برنامه ریزی پویای تطبیقی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

تقویت یادگیری، برنامه ریزی پویا سازگار، مبتنی بر داده ها، شبکه های عصبی،

Data-based adaptive dynamic programming - برنامه ریزی پویا تطبیقی Neural networks - شبکه های عصبی Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

کنترل تقریبی بهینه مبتنی بر داده ها برای بازی های غیرضروری سیستم های چند نفره با استفاده از برنامه ریزی پویای تطبیقی

چکیده انگلیسی

This paper investigates the non-zerosum game issue for unknown nonlinear systems with multi-player by using data-based adaptive dynamic programming (ADP) methods. It is known that the traditional ADP approaches require accurate system models to compute the solutions of non-zerosum games. However, for the practical nonlinear systems, system models are difficult to be obtained and thus these methods will be invalid. To overcome this difficulty, we propose two neural-network-based identification schemes. Combined with the identification results, we design a data-based actor-critic algorithm to learn and approximate the optimal solutions in real time. Subsequently, in order to reduce the computation burden of dual network algorithm, we develop a single network one. To test the feasibility and validity of our schemes, we provide two simulation examples including a linear one and a nonlinear one.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 275, 31 January 2018, Pages 192-199

نویسندگان

He Jiang, Huaguang Zhang, Geyang Xiao, Xiaohong Cui,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : کنترل تقریبی بهینه مبتنی بر داده ها برای بازی های غیرضروری سیستم های چند نفره با استفاده از برنامه ریزی پویای تطبیقی

دسترسی سریع

ارتباط

English Website