Sciweavers

43 search results - page 6 / 9
» Training Reinforcement Neurocontrollers Using the Polytope A...
Sort
View
KBS
2006
105views more  KBS 2006»
14 years 9 months ago
Robot docking based on omnidirectional vision and reinforcement learning
We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot...
David Muse, Cornelius Weber, Stefan Wermter
AGENTS
1999
Springer
15 years 2 months ago
Team-Partitioned, Opaque-Transition Reinforcement Learning
In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...
Peter Stone, Manuela M. Veloso
GECCO
2009
Springer
124views Optimization» more  GECCO 2009»
15 years 2 months ago
Reinforcement learning for games: failures and successes
We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...
Wolfgang Konen, Thomas Bartz-Beielstein
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
15 years 4 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
ESANN
2003
14 years 11 months ago
Improving iterative repair strategies for scheduling with the SVM
The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...
Kai Gersmann, Barbara Hammer