Search Sciweavers | Sciweavers

95 search results - page 2 / 19

» Policy Gradients for Cryptanalysis

click to vote

UAI
2008

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

13 years 6 months ago

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...

Gregory Lawrence, Stuart J. Russell

claim paper

Read More »

click to vote

SIAMCO
2008

112views more SIAMCO 2008»

A Knowledge-Gradient Policy for Sequential Information Collection

13 years 5 months ago

Download www.castlelab.princeton.edu

In a sequential Bayesian ranking and selection problem with independent normal populations and common known variance, we study a previously introduced measurement policy which we ...

Peter Frazier, Warren B. Powell, Savas Dayanik

claim paper

Read More »

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

14 years 6 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

13 years 10 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

click to vote

ICML
2003
IEEE

117views Machine Learning» more ICML 2003»

Model-based Policy Gradient Reinforcement Learning

14 years 6 months ago

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers