Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems

نویسنده:

سال

: 2014

چکیده: In this paper, an integral reinforcement learning (IRL) algorithm on an actor–critic structure is developed

to learn online the solution to the Hamilton–Jacobi–Bellman equation for partially-unknown constrainedinput

systems. The technique of experience replay is used to update the critic weights to solve an

IRL Bellman equation. This means, unlike existing reinforcement learning algorithms, recorded past

experiences are used concurrently with current data for adaptation of the critic weights. It is shown that

using this technique, instead of the traditional persistence of excitation condition which is often difficult

or impossible to verify online, an easy-to-check condition on the richness of the recorded data is sufficient

to guarantee convergence to a near-optimal control law. Stability of the proposed feedback control law is

shown and the effectiveness of the proposed method is illustrated with simulation examples

یو آر آی: https://libsearch.um.ac.ir:443/fum/handle/fum/3349005

کلیدواژه(گان): Integral reinforcement learning

Experience replay

Optimal control

Neural networks

Input constraints

کالکشن :

ProfDoc

نمایش متادیتا پنهان کردن متادیتا
آمار بازدید

Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems

contributor author	حمید رضا مدرّس	en
contributor author	F. L. Lewis	en
contributor author	محمدباقر نقیبی سیستانی	en
contributor author	Hamidreza Modares	fa
contributor author	Mohammad Bagher Naghibi Sistani	fa
date accessioned	2020-06-06T13:17:09Z
date available	2020-06-06T13:17:09Z
date issued	2014
identifier uri	https://libsearch.um.ac.ir:443/fum/handle/fum/3349005
description abstract	In this paper, an integral reinforcement learning (IRL) algorithm on an actor–critic structure is developed to learn online the solution to the Hamilton–Jacobi–Bellman equation for partially-unknown constrainedinput systems. The technique of experience replay is used to update the critic weights to solve an IRL Bellman equation. This means, unlike existing reinforcement learning algorithms, recorded past experiences are used concurrently with current data for adaptation of the critic weights. It is shown that using this technique, instead of the traditional persistence of excitation condition which is often difficult or impossible to verify online, an easy-to-check condition on the richness of the recorded data is sufficient to guarantee convergence to a near-optimal control law. Stability of the proposed feedback control law is shown and the effectiveness of the proposed method is illustrated with simulation examples	en
language	English
title	Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems	en
type	Journal Paper
contenttype	External Fulltext
subject keywords	Integral reinforcement learning Experience replay Optimal control Neural networks Input constraints	en
journal title	Automatica	fa
pages	193-202
journal volume	50
journal issue	1
identifier link	https://profdoc.um.ac.ir/paper-abstract-1040139.html
identifier articleid	1040139