Online Concurrent Reinforcement Learning Algorithm to Solve Two-player Zero-sum Games for Partially-unknown Nonlinear Continuous-time Systems

Author:

Year

: 2014

Abstract: Online adaptive optimal control methods based on reinforcement learning algorithms typically need to check for the persistence of excitation condition, which is necessary to be known a priori for convergence of the algorithm. However, this condition is often infeasible to implement or monitor online. This paper proposes an online concurrent reinforcement learning algorithm (CRLA) based on neural networks (NNs) to solve the H1 control problem of partially unknown continuous-time systems, in which the need for persistence of excitation condition is relaxed by using the idea of concurrent learning. First, H1 control problem is formulated as a two-player zero-sum game, and then, online CRLA is employed to obtain the approximation of the optimal value and the Nash equilibrium of the game. The proposed algorithm is implemented on actor–critic–disturbance NN approximator structure to obtain the solution of the Hamilton–Jacobi–Isaacs equation online forward in time. During the implementation of the algorithm, the control input that acts as one player attempts to make the optimal control while the other player, that is, disturbance, tries to make the worstcase possible disturbance. Novel update laws are derived for adaptation of the critic and actor NN weights. The stability of the closed-loop system is guaranteed using Lyapunov technique, and the convergence to the Nash solution of the game is obtained. Simulation results show the effectiveness of the proposed method.

URI: https://libsearch.um.ac.ir:443/fum/handle/fum/3351422

Keyword(s): H1 control,two-player zero-sum games,neural networks,online concurrent reinforcement learning algorithm

Collections :

ProfDoc

Show Full MetaData Hide Full MetaData
Statistics

Online Concurrent Reinforcement Learning Algorithm to Solve Two-player Zero-sum Games for Partially-unknown Nonlinear Continuous-time Systems

Show full item record

contributor author	شعله یاسینی	en
contributor author	علی کریم پور	en
contributor author	محمدباقر نقیبی سیستانی	en
contributor author	حمید رضا مدرّس	en
contributor author	Sholeh Yasini	fa
contributor author	Ali Karimpour	fa
contributor author	Mohammad Bagher Naghibi Sistani	fa
contributor author	Hamidreza Modares	fa
date accessioned	2020-06-06T13:21:13Z
date available	2020-06-06T13:21:13Z
date issued	2014
identifier uri	https://libsearch.um.ac.ir:443/fum/handle/fum/3351422?locale-attribute=en
description abstract	Online adaptive optimal control methods based on reinforcement learning algorithms typically need to check for the persistence of excitation condition, which is necessary to be known a priori for convergence of the algorithm. However, this condition is often infeasible to implement or monitor online. This paper proposes an online concurrent reinforcement learning algorithm (CRLA) based on neural networks (NNs) to solve the H1 control problem of partially unknown continuous-time systems, in which the need for persistence of excitation condition is relaxed by using the idea of concurrent learning. First, H1 control problem is formulated as a two-player zero-sum game, and then, online CRLA is employed to obtain the approximation of the optimal value and the Nash equilibrium of the game. The proposed algorithm is implemented on actor–critic–disturbance NN approximator structure to obtain the solution of the Hamilton–Jacobi–Isaacs equation online forward in time. During the implementation of the algorithm, the control input that acts as one player attempts to make the optimal control while the other player, that is, disturbance, tries to make the worstcase possible disturbance. Novel update laws are derived for adaptation of the critic and actor NN weights. The stability of the closed-loop system is guaranteed using Lyapunov technique, and the convergence to the Nash solution of the game is obtained. Simulation results show the effectiveness of the proposed method.	en
language	English
title	Online Concurrent Reinforcement Learning Algorithm to Solve Two-player Zero-sum Games for Partially-unknown Nonlinear Continuous-time Systems	en
type	Journal Paper
contenttype	External Fulltext
subject keywords	H1 control	en
subject keywords	two-player zero-sum games	en
subject keywords	neural networks	en
subject keywords	online concurrent reinforcement learning algorithm	en
journal title	International Journal of Adaptive Control and Signal Processing	fa
pages	21-Jan
journal volume	0
journal issue	0
identifier link	https://profdoc.um.ac.ir/paper-abstract-1044410.html
identifier articleid	1044410