Search Sciweavers | Sciweavers

13

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

13 years 7 months ago

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

13

click to vote

TSP
2010

111views Artificial Intelligence» more TSP 2010»

Performance of instantaneous frequency rate estimation using high-order phase function

13 years 24 days ago

Download www.ece.stevens-tech.edu

Abstract--The high-order phase function (HPF) is a useful tool to estimate the instantaneous frequency rate (IFR) of a signal with a polynomial phase. In this paper, the asymptotic...

Pu Wang, Hongbin Li, Igor Djurovic, Braham Himed

claim paper

Read More »

14

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

11

click to vote

AUSAI
2003
Springer

110views Artificial Intelligence» more AUSAI 2003»

On Why Discretization Works for Naive-Bayes Classifiers

13 years 9 months ago

Download www.cs.iastate.edu

We investigate why discretization is effective in naive-Bayes learning. We prove a theorem that identifies particular conditions under which discretization will result in naiveBay...

Ying Yang, Geoffrey I. Webb

claim paper

Read More »

21

click to vote

PG
2007
IEEE

156views Computer Graphics» more PG 2007»

Statistical Hypothesis Testing for Assessing Monte Carlo Estimators: Applications to Image Synthesis

14 years 12 days ago

Download artis.imag.fr

Image synthesis algorithms are commonly compared on the basis of running times and/or perceived quality of the generated images. In the case of Monte Carlo techniques, assessment ...

Kartic Subr, James Arvo

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers