Search Sciweavers | Sciweavers

2011 search results - page 143 / 403

» Universal Reinforcement Learning

172

click to vote

AAAI
1996

191views Intelligent Agents» more AAAI 1996»

Evolution-Based Discovery of Hierarchical Behaviors

15 years 5 months ago

Download www.aaai.org

Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...

Justinian P. Rosca, Dana H. Ballard

claim paper

Read More »

click to vote

ICDS
2009
IEEE

105views Theoretical Computer Science» more ICDS 2009»

Anticipating the Digital University

15 years 11 months ago

Download www.nadin.ws

The University of the Digital Society is based on interactions that facilitate learning in a new pragmatic context. Catalysts involved in the learning process replace the traditio...

Mihai Nadin

claim paper

Read More »

144

click to vote

SOCROB
2010

126views Robotics» more SOCROB 2010»

Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief

15 years 2 months ago

Download fostsvn.uopnet.plymouth.ac.uk

Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...

Antoine Hiolle, Lola Cañamero, Pierre Andry...

claim paper

Read More »

137

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 11 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

164

click to vote

COLING
2000

194views Computational Linguistics» more COLING 2000»

Automatic Optimization of Dialogue Management

15 years 5 months ago

Download www.cis.upenn.edu

Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...

Diane J. Litman, Michael S. Kearns, Satinder P. Si...

claim paper

Read More »

« Prev « First page 143 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers