Learning rate free reinforcement learning for real-time motion control using a value-gradient based policy

Publisher Website

1 December 2014

journal article
Published by Elsevier BV in Mechatronics

Vol. 24 (8), 966-974
https://doi.org/10.1016/j.mechatronics.2014.05.007

Abstract

No abstract available

Keywords

This publication has 12 references indexed in Scilit:

Efficient Model Learning Methods for Actor–Critic Control
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2011
Experience Replay for Real-Time Reinforcement Learning Control
IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 2011
Actor-Critic Control with Reference Model Learning
IFAC Proceedings Volumes, 2011
Real-time reinforcement learning by sequential Actor–Critics and experience replay
Neural Networks, 2009
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine, 2009
An analysis of reinforcement learning with function approximation
Published by Association for Computing Machinery (ACM) ,2008
Technical Update: Least-Squares Temporal Difference Learning
Machine Learning, 2002
Reinforcement Learning in Continuous Time and Space
Neural Computation, 2000
Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming
Published by Elsevier BV ,1990
A Stochastic Approximation Method
The Annals of Mathematical Statistics, 1951

Cited by 7 articles