Sciweavers

1455 search results - page 60 / 291
» Exploiting Myopic Learning
Sort
View
ICML
2006
IEEE
16 years 20 days ago
An analytic solution to discrete Bayesian reinforcement learning
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...
Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...
OTM
2004
Springer
15 years 5 months ago
Domain Ontology as a Resource Providing Adaptivity in eLearning
Abstract. This paper presents a knowledge-based approach to eLearning, where the domain ontology plays central role as a resource structuring the learning content and supporting ï¬...
Galia Angelova, Ognian Kalaydjiev, Albena Strupcha...
GECCO
2009
Springer
124views Optimization» more  GECCO 2009»
15 years 4 months ago
Reinforcement learning for games: failures and successes
We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...
Wolfgang Konen, Thomas Bartz-Beielstein
ICML
1996
IEEE
15 years 4 months ago
Discovering Structure in Multiple Learning Tasks: The TC Algorithm
Recently, there has been an increased interest in "lifelong" machine learning methods, that transfer knowledge across multiple learning tasks. Such methods have repeated...
Sebastian Thrun, Joseph O'Sullivan
ECML
2006
Springer
15 years 3 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....