Sciweavers

47 search results - page 10 / 10
» Reinforcement learning with function approximation for coope...
Sort
View
RSS
2007
176views Robotics» more  RSS 2007»
13 years 6 months ago
Active Policy Learning for Robot Planning and Exploration under Uncertainty
Abstract— This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partially-observed sequential decision processes. The algorithm is tested i...
Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...
CORR
2011
Springer
230views Education» more  CORR 2011»
12 years 11 months ago
Computational Rationalization: The Inverse Equilibrium Problem
Modeling the behavior of imperfect agents from a small number of observations is a difficult, but important task. In the singleagent decision-theoretic setting, inverse optimal co...
Kevin Waugh, Brian Ziebart, J. Andrew Bagnell