Reinforcement learning and adaptive dynamic programming for feedback control

Top Cited Papers

Abstract

Living organisms learn by acting on their environment, observing the resulting reward stimulus, and adjusting their actions accordingly to improve the reward. This action-based or reinforcement learning can capture notions of optimal behavior occurring in natural systems. We describe mathematical formulations for reinforcement learning and a practical implementation method known as adaptive dynamic programming. These give us insight into the design of controllers for man-made engineered systems that both learn and exhibit optimal behavior.

Keywords

This publication has 60 references indexed in Scilit:

Iterative local dynamic programming
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Adaptive optimal control for continuous-time linear systems based on policy iteration
Automatica, 2009
Dual heuristic programming excitation neurocontrol for generators in a multimachine power system
IEEE Transactions on Industry Applications, 2003
A neuro-dynamic programming approach to retailer inventory management
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Optimal feedback control as a theory of motor coordination
Nature Neuroscience, 2002
Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
IEEE Transactions on Neural Networks, 2002
Efficient algorithms for globally optimal trajectories
IEEE Transactions on Automatic Control, 1995
Neural net robot controller with guaranteed tracking performance
IEEE Transactions on Neural Networks, 1995
Practical issues in temporal difference learning
Machine Learning, 1992
Learning to predict by the methods of temporal differences
Machine Learning, 1988

Cited by 1088 articles