Sciweavers

536 search results - page 11 / 108
» Residual Algorithms: Reinforcement Learning with Function Ap...
Sort
View
NIPS
1996
15 years 28 days ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
CORR
2010
Springer
152views Education» more  CORR 2010»
14 years 11 months ago
Neuroevolutionary optimization
Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...
Eva Volná