Sciweavers

51 search results - page 11 / 11
» Exponentiated Gradient Methods for Reinforcement Learning
Sort
View
JETAI
2002
69views more  JETAI 2002»
13 years 5 months ago
The interaction of representations and planning objectives for decision-theoretic planning tasks
We study decision-theoretic planning or reinforcement learning in the presence of traps such as steep slopes for outdoor robots or staircases for indoor robots. In this case, achi...
Sven Koenig, Yaxin Liu