Search Sciweavers | Sciweavers

813 search results - page 137 / 163

» Ensemble Algorithms in Reinforcement Learning

128

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 4 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

136

click to vote

CPAIOR
2010
Springer

141views Operations Research» more CPAIOR 2010»

Strong Combination of Ant Colony Optimization with Constraint Programming Optimization

15 years 9 months ago

Download liris.cnrs.fr

We introduce an approach which combines ACO (Ant Colony Optimization) and IBM ILOG CP Optimizer for solving COPs (Combinatorial Optimization Problems). The problem is modeled using...

Madjid Khichane, Patrick Albert, Christine Solnon

claim paper

Read More »

119

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Receding Horizon Differential Dynamic Programming

15 years 5 months ago

Download books.nips.cc

The control of high-dimensional, continuous, non-linear dynamical systems is a key problem in reinforcement learning and control. Local, trajectory-based methods, using techniques...

Yuval Tassa, Tom Erez, William D. Smart

claim paper

Read More »

164

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 4 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

140

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 4 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

« Prev « First page 137 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers