To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning
- 1 January 1994
- conference paper
- Published by Elsevier BV
Abstract
No abstract availableKeywords
This publication has 6 references indexed in Scilit:
- Computationally efficient adaptive control algorithms for Markov chainsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A Reinforcement Learning Method for Maximizing Undiscounted RewardsPublished by Elsevier BV ,1993
- Automatic programming of behavior-based robots using reinforcement learningArtificial Intelligence, 1992
- Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic ProgrammingPublished by Elsevier BV ,1990
- Learning to predict by the methods of temporal differencesMachine Learning, 1988
- Neuronlike adaptive elements that can solve difficult learning control problemsIEEE Transactions on Systems, Man, and Cybernetics, 1983