Nonparametric Model-Based Reinforcement Learning

10 years 5 months ago
Nonparametric Model-Based Reinforcement Learning
This paper describes some of the interactions of model learning algorithms and planning algorithms we have found in exploring model-based reinforcement learning. The paper focuses on how local trajectory optimizers can be used e ectively with learned nonparametric models. We nd that trajectory planners that are fully consistent with the learned model often have di culty nding reasonable plans in the early stages of learning. Trajectory planners that balance obeying the learned model with minimizing cost (or maximizing reward) often do better, even if the plan is not fully consistent with the learned model.
Christopher G. Atkeson
Added 01 Nov 2010
Updated 01 Nov 2010
Type Conference
Year 1997
Where NIPS
Authors Christopher G. Atkeson
Comments (0)