Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems
Top Cited Papers
- 7 March 2013
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Cybernetics
- Vol. 43 (2), 779-789
- https://doi.org/10.1109/tsmcb.2012.2216523
Abstract
In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite-horizon discrete-time nonlinear systems with finite approximation errors. The idea is to use an iterative ADP algorithm to obtain the iterative control law that makes the iterative performance index function reach the optimum. When the iterative control law and the iterative performance index function in each iteration cannot be accurately obtained, the convergence conditions of the iterative ADP algorithm are obtained. When convergence conditions are satisfied, it is shown that the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions under some mild assumptions. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the present method.Keywords
This publication has 30 references indexed in Scilit:
- Adaptive dynamic programming with stable value iteration algorithm for discrete-time nonlinear systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential gamesAutomatica, 2010
- Bio-inspired Algorithms for Autonomous Deployment and Localization of Sensor NodesIEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 2010
- Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output DataIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2010
- Reinforcement learning and adaptive dynamic programming for feedback controlIEEE Circuits and Systems Magazine, 2009
- Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functionsNeurocomputing, 2009
- Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural NetworksIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2008
- A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration AlgorithmIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2008
- Higher Level Application of ADP: A Next Phase for the Control Field?IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2008
- Discrete-Time Adaptive Dynamic Programming using Wavelet Basis Function Neural NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007