Search Sciweavers | Sciweavers

437 search results - page 4 / 88

» Policy Gradient Critics

141

click to vote

DATE
2008
IEEE

111views Hardware» more DATE 2008»

Incremental Criticality and Yield Gradients

15 years 12 months ago

Download www.date-conference.com

— Criticality and yield gradients are two crucial diagnostic metrics obtained from Statistical Static Timing Analysis (SSTA). They provide valuable information to guide timing op...

Jinjun Xiong, Vladimir Zolotov, Chandu Visweswaria...

claim paper

Read More »

170

click to vote

UAI
2008

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

15 years 7 months ago

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...

Gregory Lawrence, Stuart J. Russell

claim paper

Read More »

160

click to vote

SIAMCO
2008

112views more SIAMCO 2008»

A Knowledge-Gradient Policy for Sequential Information Collection

15 years 5 months ago

Download www.castlelab.princeton.edu

In a sequential Bayesian ranking and selection problem with independent normal populations and common known variance, we study a previously introduced measurement policy which we ...

Peter Frazier, Warren B. Powell, Savas Dayanik

claim paper

Read More »

122

click to vote

ICML
2003
IEEE

117views Machine Learning» more ICML 2003»

Model-based Policy Gradient Reinforcement Learning

16 years 6 months ago

Download www.aaai.org

Xin Wang, Thomas G. Dietterich

claim paper

Read More »

163

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 6 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 4 / 88 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers