Sciweavers

51 search results - page 1 / 11
» Exponentiated Gradient Methods for Reinforcement Learning
Sort
View
84
Voted
ICML
1997
IEEE
16 years 1 months ago
Exponentiated Gradient Methods for Reinforcement Learning
Doina Precup, Richard S. Sutton
IJCAI
2001
15 years 2 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
108
Voted
IJCAI
2003
15 years 2 months ago
Covariant Policy Search
We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...
J. Andrew Bagnell, Jeff G. Schneider
115
Voted
ICML
2007
IEEE
16 years 1 months ago
Exponentiated gradient algorithms for log-linear structured prediction
Conditional log-linear models are a commonly used method for structured prediction. Efficient learning of parameters in these models is therefore an important problem. This paper ...
Amir Globerson, Terry Koo, Xavier Carreras, Michae...
103
Voted
NIPS
2001
15 years 2 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...