Sciweavers

2011 search results - page 143 / 403
» Universal Reinforcement Learning
Sort
View
AAAI
1996
15 years 5 months ago
Evolution-Based Discovery of Hierarchical Behaviors
Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...
Justinian P. Rosca, Dana H. Ballard
ICDS
2009
IEEE
15 years 11 months ago
Anticipating the Digital University
The University of the Digital Society is based on interactions that facilitate learning in a new pragmatic context. Catalysts involved in the learning process replace the traditio...
Mihai Nadin
SOCROB
2010
126views Robotics» more  SOCROB 2010»
15 years 2 months ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
15 years 11 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
COLING
2000
15 years 5 months ago
Automatic Optimization of Dialogue Management
Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...
Diane J. Litman, Michael S. Kearns, Satinder P. Si...