On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning
- 1 June 2000
- journal article
- Published by Springer Science and Business Media LLC in Journal of Optimization Theory and Applications
- Vol. 105 (3), 589-608
- https://doi.org/10.1023/A:1004641123405
Abstract
No abstract availableKeywords
This publication has 6 references indexed in Scilit:
- A neuro-dynamic programming approach to retailer inventory managementPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Mean-Field Theory for Batched TD(λ)Neural Computation, 1997
- An analysis of temporal-difference learning with function approximationIEEE Transactions on Automatic Control, 1997
- Discrete Stochastic ProcessesPublished by Springer Science and Business Media LLC ,1996
- Adaptive Algorithms and Stochastic ApproximationsPublished by Springer Science and Business Media LLC ,1990
- Functional Approximations and Dynamic ProgrammingMathematical Tables and Other Aids to Computation, 1959