Sciweavers

53 search results - page 9 / 11
» Shaping multi-agent systems with gradient reinforcement lear...
Sort
View
ICRA
2008
IEEE
128views Robotics» more  ICRA 2008»
15 years 4 months ago
Learning from human teachers with Socially Guided Exploration
— We present a learning mechanism, Socially Guided Exploration, in which a robot learns new tasks through a combination of self-exploration and social interaction. The system’s...
Cynthia Breazeal, Andrea Lockerd Thomaz
IBERAMIA
2010
Springer
14 years 8 months ago
Dynamic Reward Shaping: Training a Robot by Voice
Reinforcement Learning is commonly used for learning tasks in robotics, however, traditional algorithms can take very long training times. Reward shaping has been recently used to ...
Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...
IS
2010
14 years 6 months ago
Multicriteria reinforcement learning based on a Russian doll method for network routing
The routing in communication networks is typically a multicriteria decision making (MCDM) problem. However, setting the parameters of most used MCDM methods to fit the preferences ...
Alain Pétrowski, Farouk Aissanou, Ilham Ben...
EUSFLAT
2009
140views Fuzzy Logic» more  EUSFLAT 2009»
14 years 7 months ago
Incremental Possibilistic Approach for Online Clustering and Classification
In this paper, we propose to develop the supervised classification method Fuzzy Pattern Matching to be in addition a non supervised one. The goal is to monitor dynamic systems with...
Moamar Sayed Mouchaweh, Bernard Riera
JMLR
2006
124views more  JMLR 2006»
14 years 9 months ago
Policy Gradient in Continuous Time
Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...
Rémi Munos