Search Sciweavers | Sciweavers

813 search results - page 108 / 163

» Ensemble Algorithms in Reinforcement Learning

150

click to vote

ML
2006
ACM

163views Machine Learning» more ML 2006»

Extremely randomized trees

15 years 4 months ago

Download www.montefiore.ulg.ac.be

Abstract This paper proposes a new tree-based ensemble method for supervised classification and regression problems. It essentially consists of randomizing strongly both attribute ...

Pierre Geurts, Damien Ernst, Louis Wehenkel

claim paper

Read More »

165

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 5 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

150

click to vote

ICML
2004
IEEE

116views Machine Learning» more ICML 2004»

Towards tight bounds for rule learning

15 years 9 months ago

Download wwwkramer.in.tum.de

While there is a lot of empirical evidence showing that traditional rule learning approaches work well in practice, it is nearly impossible to derive analytical results about thei...

Ulrich Rückert, Stefan Kramer

claim paper

Read More »

137

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

15 years 5 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

125

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

15 years 6 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

« Prev « First page 108 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers