Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
Author:
, , , ,Year
: 2014
Abstract: In this paper, an integral reinforcement learning (IRL) algorithm on an actor–critic structure is developed
to learn online the solution to the Hamilton–Jacobi–Bellman equation for partially-unknown constrainedinput
systems. The technique of experience replay is used to update the critic weights to solve an
IRL Bellman equation. This means, unlike existing reinforcement learning algorithms, recorded past
experiences are used concurrently with current data for adaptation of the critic weights. It is shown that
using this technique, instead of the traditional persistence of excitation condition which is often difficult
or impossible to verify online, an easy-to-check condition on the richness of the recorded data is sufficient
to guarantee convergence to a near-optimal control law. Stability of the proposed feedback control law is
shown and the effectiveness of the proposed method is illustrated with simulation examples
to learn online the solution to the Hamilton–Jacobi–Bellman equation for partially-unknown constrainedinput
systems. The technique of experience replay is used to update the critic weights to solve an
IRL Bellman equation. This means, unlike existing reinforcement learning algorithms, recorded past
experiences are used concurrently with current data for adaptation of the critic weights. It is shown that
using this technique, instead of the traditional persistence of excitation condition which is often difficult
or impossible to verify online, an easy-to-check condition on the richness of the recorded data is sufficient
to guarantee convergence to a near-optimal control law. Stability of the proposed feedback control law is
shown and the effectiveness of the proposed method is illustrated with simulation examples
Keyword(s): Integral reinforcement learning
Experience replay
Optimal control
Neural networks
Input constraints
Collections
:
-
Statistics
Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
Show full item record
contributor author | حمید رضا مدرّس | en |
contributor author | F. L. Lewis | en |
contributor author | محمدباقر نقیبی سیستانی | en |
contributor author | Hamidreza Modares | fa |
contributor author | Mohammad Bagher Naghibi Sistani | fa |
date accessioned | 2020-06-06T13:17:09Z | |
date available | 2020-06-06T13:17:09Z | |
date issued | 2014 | |
identifier uri | http://libsearch.um.ac.ir:80/fum/handle/fum/3349005?locale-attribute=en | |
description abstract | In this paper, an integral reinforcement learning (IRL) algorithm on an actor–critic structure is developed to learn online the solution to the Hamilton–Jacobi–Bellman equation for partially-unknown constrainedinput systems. The technique of experience replay is used to update the critic weights to solve an IRL Bellman equation. This means, unlike existing reinforcement learning algorithms, recorded past experiences are used concurrently with current data for adaptation of the critic weights. It is shown that using this technique, instead of the traditional persistence of excitation condition which is often difficult or impossible to verify online, an easy-to-check condition on the richness of the recorded data is sufficient to guarantee convergence to a near-optimal control law. Stability of the proposed feedback control law is shown and the effectiveness of the proposed method is illustrated with simulation examples | en |
language | English | |
title | Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems | en |
type | Journal Paper | |
contenttype | External Fulltext | |
subject keywords | Integral reinforcement learning Experience replay Optimal control Neural networks Input constraints | en |
journal title | Automatica | fa |
pages | 193-202 | |
journal volume | 50 | |
journal issue | 1 | |
identifier link | https://profdoc.um.ac.ir/paper-abstract-1040139.html | |
identifier articleid | 1040139 |