Search Sciweavers | Sciweavers

1237 search results - page 203 / 248

» Simulation sampling with live-points

179

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

225

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

162

click to vote

GRAPHICSINTERFACE
2000

119views Computer Graphics» more GRAPHICSINTERFACE 2000»

Adaptive Representation of Specular Light Flux

15 years 8 months ago

Download www.iro.umontreal.ca

Caustics produce beautiful and intriguing illumination patterns. However, their complex behavior make them difficult to simulate accurately in all but the simplest configurations....

Normand Brière, Pierre Poulin

claim paper

Read More »

172

click to vote

AAAI
1996

121views Intelligent Agents» more AAAI 1996»

A Clinician's Tool for Analyzing Non-Compliance

15 years 8 months ago

Download research.microsoft.com

We describe a computer program to assist a clinician with assessing the e cacy of treatments in experimental studies for which treatment assignment is random but subject complianc...

David Maxwell Chickering, Judea Pearl

claim paper

Read More »

201

click to vote

WCE
2007

204views Electrical And Computer Engi...» more WCE 2007»

Bootstrap Confidence Interval for the Median Failure Time of Three-Parameter Weibull Distribution

15 years 8 months ago

Download www.iaeng.org

— In many applications of failure time data analysis, it is important to perform inferences about the median of the distribution function in situations of failure time data model...

N. A. Ibrahim, A. Kudus

claim paper

Read More »

« Prev « First page 203 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers