Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems

Top Cited Papers

7 March 2013

journal article
research article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Cybernetics

Vol. 43 (2), 779-789
https://doi.org/10.1109/tsmcb.2012.2216523

Abstract

In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite-horizon discrete-time nonlinear systems with finite approximation errors. The idea is to use an iterative ADP algorithm to obtain the iterative control law that makes the iterative performance index function reach the optimum. When the iterative control law and the iterative performance index function in each iteration cannot be accurately obtained, the convergence conditions of the iterative ADP algorithm are obtained. When convergence conditions are satisfied, it is shown that the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions under some mild assumptions. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the present method.

Keywords

This publication has 30 references indexed in Scilit:

Adaptive dynamic programming with stable value iteration algorithm for discrete-time nonlinear systems
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
Automatica, 2010
Bio-inspired Algorithms for Autonomous Deployment and Localization of Sensor Nodes
IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 2010
Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2010
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine, 2009
Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
Neurocomputing, 2009
Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2008
A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2008
Higher Level Application of ADP: A Next Phase for the Control Field?
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2008
Discrete-Time Adaptive Dynamic Programming using Wavelet Basis Function Neural Networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007

Cited by 249 articles