Sciweavers

98
Voted
JMLR
2006
124views more  JMLR 2006»
14 years 8 months ago
Policy Gradient in Continuous Time
Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...
Rémi Munos