Search Sciweavers | Sciweavers

2011 search results - page 187 / 403

» Universal Reinforcement Learning

129

Voted

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

15 years 7 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

133

click to vote

ALT
2005
Springer

121views Machine Learning» more ALT 2005»

Monotone Conditional Complexity Bounds on Future Prediction Errors

16 years 1 months ago

Download www.idsia.ch

We bound the future loss when predicting any (computably) stochastic sequence online. Solomonoﬀ ﬁnitely bounded the total deviation of his universal predictor M from the true ...

Alexey V. Chernov, Marcus Hutter

claim paper

Read More »

111

click to vote

BMCBI
2007

105views more BMCBI 2007»

Constrained hidden Markov models for population-based haplotyping

15 years 5 months ago

Download www.cs.helsinki.fi

abstract Niels Landwehr1 , Taneli Mielik¨ainen2 , Lauri Eronen2 , Hannu Toivonen1,2 , and Heikki Mannila2 1 Machine Learning Lab, Dept. of Comp. Science, University of Freiburg, G...

Niels Landwehr, Taneli Mielikäinen, Lauri Ero...

claim paper

Read More »

169

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 5 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

148

Voted

ECML
2004
Springer

154views Machine Learning» more ECML 2004»

Experiments in Value Function Approximation with Sparse Support Vector Regression

15 years 10 months ago

Download userweb.cs.utexas.edu

Abstract. We present ﬁrst experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...

Tobias Jung, Thomas Uthmann

claim paper

Read More »

« Prev « First page 187 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers