Search Sciweavers | Sciweavers

51 search results - page 2 / 11

» Exponentiated Gradient Methods for Reinforcement Learning

click to vote

JMLR
2008

230views more JMLR 2008»

Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks

13 years 6 months ago

Download www.stat.berkeley.edu

Log-linear and maximum-margin models are two commonly-used methods in supervised machine learning, and are frequently used in structured prediction problems. Efficient learning of...

Michael Collins, Amir Globerson, Terry Koo, Xavier...

claim paper

Read More »

click to vote

ESANN
2008

115views Neural Networks» more ESANN 2008»

13 years 7 months ago

Similarities and differences between policy gradient methods and evolution strategies

Download www.dice.ucl.ac.be

Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

14 years 7 days ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

ICANN
2005
Springer

151views Neural Networks» more ICANN 2005»

Reinforcement Learning in MirrorBot

13 years 11 months ago

Download fias.uni-frankfurt.de

For this special session of EU projects in the area of NeuroIT, we will review the progress of the MirrorBot project with special emphasis on its relation to reinforcement learning...

Cornelius Weber, David Muse, Mark Elshaw, Stefan W...

claim paper

Read More »

« Prev « First page 2 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers