Reinforcement learning and adaptive dynamic programming for feedback control
Top Cited Papers
- 28 August 2009
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Circuits and Systems Magazine
- Vol. 9 (3), 32-50
- https://doi.org/10.1109/mcas.2009.933854
Abstract
Living organisms learn by acting on their environment, observing the resulting reward stimulus, and adjusting their actions accordingly to improve the reward. This action-based or reinforcement learning can capture notions of optimal behavior occurring in natural systems. We describe mathematical formulations for reinforcement learning and a practical implementation method known as adaptive dynamic programming. These give us insight into the design of controllers for man-made engineered systems that both learn and exhibit optimal behavior.Keywords
This publication has 60 references indexed in Scilit:
- Iterative local dynamic programmingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Adaptive optimal control for continuous-time linear systems based on policy iterationAutomatica, 2009
- Dual heuristic programming excitation neurocontrol for generators in a multimachine power systemIEEE Transactions on Industry Applications, 2003
- A neuro-dynamic programming approach to retailer inventory managementPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Optimal feedback control as a theory of motor coordinationNature Neuroscience, 2002
- Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogeneratorIEEE Transactions on Neural Networks, 2002
- Efficient algorithms for globally optimal trajectoriesIEEE Transactions on Automatic Control, 1995
- Neural net robot controller with guaranteed tracking performanceIEEE Transactions on Neural Networks, 1995
- Practical issues in temporal difference learningMachine Learning, 1992
- Learning to predict by the methods of temporal differencesMachine Learning, 1988