Search Sciweavers | Sciweavers

813 search results - page 114 / 163

» Ensemble Algorithms in Reinforcement Learning

149

click to vote

FLAIRS
2008

132views Artificial Intelligence» more FLAIRS 2008»

Learning Continuous Action Models in a Real-Time Strategy Environment

15 years 6 months ago

Download www.knexusresearch.com

Although several researchers have integrated methods for reinforcement learning (RL) with case-based reasoning (CBR) to model continuous action spaces, existing integrations typic...

Matthew Molineaux, David W. Aha, Philip Moore

claim paper

Read More »

144

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 5 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

120

click to vote

ICML
2001
IEEE

132views Machine Learning» more ICML 2001»

Expectation Maximization for Weakly Labeled Data

16 years 5 months ago

Download characters.media.mit.edu

We call data weakly labeled if it has no exact label but rather a numerical indication of correctness of the label "guessed" by the learning algorithm - a situation comm...

Yuri A. Ivanov, Bruce Blumberg, Alex Pentland

claim paper

Read More »

137

click to vote

MCS
2009
Springer

194views Pattern Recognition» more MCS 2009»

Incremental Learning of Variable Rate Concept Drift

15 years 9 months ago

Download users.rowan.edu

We have recently introduced an incremental learning algorithm, Learn++ .NSE, for Non-Stationary Environments, where the data distribution changes over time due to concept drift. Le...

Ryan Elwell, Robi Polikar

claim paper

Read More »

128

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 5 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

« Prev « First page 114 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers