Sciweavers

32 search results - page 7 / 7
» Batch Reinforcement Learning with State Importance
Sort
View
ICML
1999
IEEE
14 years 5 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
JETAI
2002
69views more  JETAI 2002»
13 years 4 months ago
The interaction of representations and planning objectives for decision-theoretic planning tasks
We study decision-theoretic planning or reinforcement learning in the presence of traps such as steep slopes for outdoor robots or staircases for indoor robots. In this case, achi...
Sven Koenig, Yaxin Liu