Search Sciweavers | Sciweavers

1434 search results - page 247 / 287

» Stochastic computation

150

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

187

Voted

NIPS
2001

149views Information Technology» more NIPS 2001»

Online Learning with Kernels

15 years 7 months ago

Download eprints.pascal-network.org

Abstract--Kernel-based algorithms such as support vector machines have achieved considerable success in various problems in batch setting, where all of the training data is availab...

Jyrki Kivinen, Alex J. Smola, Robert C. Williamson

claim paper

Read More »

173

click to vote

UAI
2004

195views Artificial Intelligence» more UAI 2004»

Solving Factored MDPs with Continuous and Discrete Variables

15 years 7 months ago

Download www.cs.pitt.edu

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...

Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...

claim paper

Read More »

161

click to vote

WSC
2001

103views Modeling And Simulation» more WSC 2001»

Quantile and histogram estimation

15 years 7 months ago

Download www.informs-sim.org

This paper discusses implementation of a sequential procedure to construct proportional half-width confidence intervals for a simulation estimator of the steady-state quantiles an...

E. Jack Chen, W. David Kelton

claim paper

Read More »

141

click to vote

WSC
2004

88views Modeling And Simulation» more WSC 2004»

Simulation-Based Optimization for Material Dispatching in a Retailer Network

15 years 7 months ago

Download www.informs-sim.org

This paper presents preliminary work done on simulationbased optimization of a stochastic material-dispatching system in a retailer network. The problem we consider is one of dete...

Ganesh Subramaniam, Abhijit Gosavi

claim paper

Read More »

« Prev « First page 247 / 287 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers