Value iteration based integral reinforcement learning approach for Hâ controller design of continuous-time nonlinear systems

Article ID	Journal	Published Year	Pages	File Type
6864431	Neurocomputing	2018	9 Pages	PDF

Abstract

In this paper, a novel integral reinforcement learning approach is developed based on value iteration (VI) for designing the Hâ controller of continuous-time (CT) nonlinear systems. First, the VI learning mechanism is introduced to solve the zero-sum game problems, which is equivalent to the Hamilton-Jacobi-Isaacs (HJI) equation arising in Hâ control problems. Since the proposed method is based on VI learning mechanism, it does not require the admissible control for the implementation, and thus satisfies a more general initial condition than the works based on policy iteration (PI). The iterative property of the value function is analysed with an arbitrary initial positive function, and the Hâ controller can be derived as the iteration converges. For the implementation of the proposed method, three neural networks are introduced to approximate the iterative value function, the iterative control policy and the iterative disturbance policy, respectively. To verify the effectiveness of the VI based method, a linear case and a nonlinear case are presented, respectively.

Keywords

Value iteration Continuous-time systems H∞ control Reinforcement learning