Sciweavers

179 search results - page 3 / 36
» Learning Relational Navigation Policies
Sort
View
NIPS
2003
14 years 11 months ago
Approximate Policy Iteration with a Policy Language Bias
We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...
Alan Fern, Sung Wook Yoon, Robert Givan
76
Voted
ICML
2008
IEEE
15 years 10 months ago
Non-parametric policy gradients: a unified treatment of propositional and relational domains
Policy gradient approaches are a powerful instrument for learning how to interact with the environment. Existing approaches have focused on propositional and continuous domains on...
Kristian Kersting, Kurt Driessens
EPIA
2007
Springer
15 years 3 months ago
Generalization and Transfer Learning in Noise-Affected Robot Navigation Tasks
Abstract. When a robot learns to solve a goal-directed navigation task with reinforcement learning, the acquired strategy can usually exclusively be applied to the task that has be...
Lutz Frommberger
ECML
2003
Springer
15 years 2 months ago
Could Active Perception Aid Navigation of Partially Observable Grid Worlds?
Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can find itself unable to distinguish between differing state...
Paul A. Crook, Gillian Hayes