Integral reinforcement learning with explorations for continuous-time nonlinear systems

Open Access

1 June 2012

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Abstract

This paper focuses on the integral reinforcement learning (I-RL) for input-affine continuous-time (CT) nonlinear systems where a known time-varying signal called an exploration is injected through the control input. First, we propose a modified I-RL method which effectively eliminates the effects of the explorations on the algorithm. Next, based on the result, an actor-critic I-RL technique is presented for the same nonlinear systems with completely unknown dynamics. Finally, the least-squares implementation method with the exact parameterizations is presented for each proposed one which can be solved under the given persistently exciting (PE) conditions. A simulation example is given to verify the effectiveness of the proposed methods.

Keywords

This publication has 19 references indexed in Scilit:

Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
Automatica, 2012
A model-free robust policy iteration algorithm for optimal control of nonlinear systems
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem
Automatica, 2010
Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
Neural Networks, 2009
Adaptive Dynamic Programming: An Introduction
IEEE Computational Intelligence Magazine, 2009
Adaptive optimal control for continuous-time linear systems based on policy iteration
Automatica, 2009
Continuous-Time Adaptive Critics
IEEE Transactions on Neural Networks, 2007
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
Automatica, 2005
A note on persistency of excitation
Systems & Control Letters, 2004
Adaptive dynamic programming
IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 2002

Cited by 11 articles