Sciweavers

65 search results - page 2 / 13
» Bias and variance in value function estimation
Sort
View
JMLR
2006
143views more  JMLR 2006»
13 years 5 months ago
Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...
Rémi Munos
BMCBI
2006
165views more  BMCBI 2006»
13 years 5 months ago
Improved variance estimation of classification performance via reduction of bias caused by small sample size
Background: Supervised learning for classification of cancer employs a set of design examples to learn how to discriminate between tumors. In practice it is crucial to confirm tha...
Ulrika Wickenberg-Bolin, Hanna Göransson, M&a...
COLT
2000
Springer
13 years 9 months ago
Bias-Variance Error Bounds for Temporal Difference Updates
We give the first rigorous upper bounds on the error of temporal difference (td) algorithms for policy evaluation as a function of the amount of experience. These upper bounds pr...
Michael J. Kearns, Satinder P. Singh
WSC
2007
13 years 7 months ago
Replicated batch means for steady-state simulations with initial transients
We provide asymptotic expressions for the expected value and variance of the replicated batch means variance estimator when the stochastic process being simulated has an additive ...
Christos Alexopoulos, Sigrún Andradó...
WSC
2007
13 years 7 months ago
Low bias integrated path estimators
We consider the problem of estimating the time-average variance constant for a stationary process. A previous paper described an approach based on multiple integrations of the sim...
James M. Calvin