Sciweavers

71 search results - page 15 / 15
» A Behavior Adaptation Algorithm based on Hierarchical Partia...
Sort
View
NIPS
1996
13 years 6 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies