Search Sciweavers | Sciweavers

Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...

Antoine Hiolle, Lola Cañamero, Pierre Andry...

claim paper

Read More »

131

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 10 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

157

click to vote

COLING
2000

194views Computational Linguistics» more COLING 2000»

Automatic Optimization of Dialogue Management

15 years 5 months ago

Download www.cis.upenn.edu

Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...

Diane J. Litman, Michael S. Kearns, Satinder P. Si...

claim paper

Read More »

« Prev « First page 140 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers