Sciweavers

1434 search results - page 247 / 287
» Stochastic computation
Sort
View
NIPS
2001
14 years 11 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
NIPS
2001
14 years 11 months ago
Online Learning with Kernels
Abstract--Kernel-based algorithms such as support vector machines have achieved considerable success in various problems in batch setting, where all of the training data is availab...
Jyrki Kivinen, Alex J. Smola, Robert C. Williamson
UAI
2004
14 years 11 months ago
Solving Factored MDPs with Continuous and Discrete Variables
Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...
Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...
WSC
2001
14 years 11 months ago
Quantile and histogram estimation
This paper discusses implementation of a sequential procedure to construct proportional half-width confidence intervals for a simulation estimator of the steady-state quantiles an...
E. Jack Chen, W. David Kelton
WSC
2004
14 years 11 months ago
Simulation-Based Optimization for Material Dispatching in a Retailer Network
This paper presents preliminary work done on simulationbased optimization of a stochastic material-dispatching system in a retailer network. The problem we consider is one of dete...
Ganesh Subramaniam, Abhijit Gosavi