کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
8960119 1646381 2018 31 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Linear quadratic tracking control of unknown discrete-time systems using value iteration algorithm
ترجمه فارسی عنوان
کنترل ردیابی خطی سیستم های زمان گسسته ناشناخته با استفاده از الگوریتم تکرار ارزش
کلمات کلیدی
برنامه ریزی پویا سازگار، ردیابی خطی خطی، تقویت یادگیری، تکرار ارزش،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
In this paper, an optimal tracking control scheme is proposed to solve the infinite-horizon linear quadratic tracking (LQT) problem using iterative adaptive dynamic programming (ADP) algorithm. The reference trajectory is assumed to be produced by a linear command generator. First, via system transformation, an augmented system composed of controlled system and command generator is constructed. Then we derive the Bellman equation in terms of the transformed system with discount factor in cost function. In order to avoid requirement for knowledge of system dynamics, the iterative ADP algorithm is introduced to solve the Bellman equation with convergence analysis. A novel approach based on controllability and observability analysis is presented to show the stability of tracking error. For facilitating the implementation of this iterative approach, three neural networks (NNs) are employed as parametric structures to identify the unknown system dynamics, approximate performance function and search control policy, respectively. Finally, a simulation example is included to verify the effectiveness of the proposed scheme.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 314, 7 November 2018, Pages 86-93
نویسندگان
, , ,