Search Sciweavers | Sciweavers

14 search results - page 1 / 3

» Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

13 years 5 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

13 years 5 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

click to vote

ORL
2007

112views more ORL 2007»

Competitive analysis of a dispatch policy for a dynamic multi-period routing problem

13 years 4 months ago

Download www2.isye.gatech.edu

We analyze a simple and natural on-line algorithm (dispatch policy) for a dynamic multiperiod uncapacitated routing problem, in which at the beginning of each time period a set of...

Enrico Angelelli, Martin W. P. Savelsbergh, Maria ...

claim paper

Read More »

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

13 years 10 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

click to vote

COMPUTING
2004

204views more COMPUTING 2004»

Image Registration by a Regularized Gradient Flow. A Streaming Implementation in DX9 Graphics Hardware

13 years 4 months ago

Download www.mpi-inf.mpg.de

The presented image registration method uses a regularized gradient flow to correlate the intensities in two images. Thereby, an energy functional is successively minimized by des...

Robert Strzodka, Marc Droske, Martin Rumpf

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers