Sciweavers

2108 search results - page 140 / 422
» Tracking in Reinforcement Learning
Sort
View
89
Voted
ICML
1995
IEEE
16 years 2 months ago
Tracking the Best Expert
Mark Herbster, Manfred K. Warmuth
COLT
2006
Springer
15 years 5 months ago
Tracking the Best Hyperplane with a Simple Budget Perceptron
Nicolò Cesa-Bianchi, Claudio Gentile
SOCROB
2010
126views Robotics» more  SOCROB 2010»
15 years 7 days ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
15 years 8 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
COLING
2000
15 years 3 months ago
Automatic Optimization of Dialogue Management
Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...
Diane J. Litman, Michael S. Kearns, Satinder P. Si...