Sciweavers

236 search results - page 1 / 48
» Bias and Variance Approximation in Value Function Estimates
Sort
View
MANSCI
2007
67views more  MANSCI 2007»
13 years 5 months ago
Bias and Variance Approximation in Value Function Estimates
Shie Mannor, Duncan Simester, Peng Sun, John N. Ts...
ICML
2004
IEEE
13 years 10 months ago
Bias and variance in value function estimation
Shie Mannor, Duncan Simester, Peng Sun, John N. Ts...
AAAI
2008
13 years 7 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
JMLR
2006
143views more  JMLR 2006»
13 years 5 months ago
Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...
Rémi Munos
NIPS
2001
13 years 6 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar