Sciweavers

437 search results - page 4 / 88
» Policy Gradient Critics
Sort
View
DATE
2008
IEEE
111views Hardware» more  DATE 2008»
14 years 21 days ago
Incremental Criticality and Yield Gradients
— Criticality and yield gradients are two crucial diagnostic metrics obtained from Statistical Static Timing Analysis (SSTA). They provide valuable information to guide timing op...
Jinjun Xiong, Vladimir Zolotov, Chandu Visweswaria...
UAI
2008
13 years 7 months ago
Improving Gradient Estimation by Incorporating Sensor Data
An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...
Gregory Lawrence, Stuart J. Russell
SIAMCO
2008
112views more  SIAMCO 2008»
13 years 6 months ago
A Knowledge-Gradient Policy for Sequential Information Collection
In a sequential Bayesian ranking and selection problem with independent normal populations and common known variance, we study a previously introduced measurement policy which we ...
Peter Frazier, Warren B. Powell, Savas Dayanik
ICML
2003
IEEE
14 years 7 months ago
Model-based Policy Gradient Reinforcement Learning
Xin Wang, Thomas G. Dietterich
ICML
2001
IEEE
14 years 7 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta