Search Sciweavers | Sciweavers

169

Voted

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

13 years 8 months ago

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

103

Voted

ATAL
2009
Springer

150views Intelligent Agents» more ATAL 2009»

Learning of coordination: exploiting sparse interactions in multiagent systems

15 years 7 months ago

Download www.cs.cmu.edu

Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simpliﬁed if the coordination needs are known to be limi...

Francisco S. Melo, Manuela M. Veloso

claim paper

Read More »

102

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 7 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

120

Voted

COLING
2000

194views Computational Linguistics» more COLING 2000»

Automatic Optimization of Dialogue Management

15 years 2 months ago

Download www.cis.upenn.edu

Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...

Diane J. Litman, Michael S. Kearns, Satinder P. Si...

claim paper

Read More »

100

click to vote

ICML
2005
IEEE

201views Machine Learning» more ICML 2005»

Interactive learning of mappings from visual percepts to actions

16 years 1 months ago

Download www.machinelearning.org

We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier i...

Justus H. Piater, Sébastien Jodogne

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers