Search Sciweavers | Sciweavers

1455 search results - page 60 / 291

» Exploiting Myopic Learning

137

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 3 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

120

click to vote

OTM
2004
Springer

134views Internet Technology» more OTM 2004»

Domain Ontology as a Resource Providing Adaptivity in eLearning

15 years 8 months ago

Download lml.bas.bg

Abstract. This paper presents a knowledge-based approach to eLearning, where the domain ontology plays central role as a resource structuring the learning content and supporting �...

Galia Angelova, Ognian Kalaydjiev, Albena Strupcha...

claim paper

Read More »

135

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

15 years 7 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

133

click to vote

ICML
1996
IEEE

182views Machine Learning» more ICML 1996»

Discovering Structure in Multiple Learning Tasks: The TC Algorithm

15 years 7 months ago

Download www.ri.cmu.edu

Recently, there has been an increased interest in "lifelong" machine learning methods, that transfer knowledge across multiple learning tasks. Such methods have repeated...

Sebastian Thrun, Joseph O'Sullivan

claim paper

Read More »

125

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 6 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

« Prev « First page 60 / 291 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers