Learning rate free reinforcement learning for real-time motion control using a value-gradient based policy
- 1 December 2014
- journal article
- Published by Elsevier BV in Mechatronics
- Vol. 24 (8), 966-974
- https://doi.org/10.1016/j.mechatronics.2014.05.007
Abstract
No abstract availableKeywords
This publication has 12 references indexed in Scilit:
- Efficient Model Learning Methods for Actor–Critic ControlIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2011
- Experience Replay for Real-Time Reinforcement Learning ControlIEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 2011
- Actor-Critic Control with Reference Model LearningIFAC Proceedings Volumes, 2011
- Real-time reinforcement learning by sequential Actor–Critics and experience replayNeural Networks, 2009
- Reinforcement learning and adaptive dynamic programming for feedback controlIEEE Circuits and Systems Magazine, 2009
- An analysis of reinforcement learning with function approximationPublished by Association for Computing Machinery (ACM) ,2008
- Technical Update: Least-Squares Temporal Difference LearningMachine Learning, 2002
- Reinforcement Learning in Continuous Time and SpaceNeural Computation, 2000
- Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic ProgrammingPublished by Elsevier BV ,1990
- A Stochastic Approximation MethodThe Annals of Mathematical Statistics, 1951