Search Sciweavers | Sciweavers

1233 search results - page 199 / 247

» Reinforcement Learning in MirrorBot

128

click to vote

HYBRID
2005
Springer

102views Control Systems» more HYBRID 2005»

Learning Multi-modal Control Programs

15 years 9 months ago

Download users.ece.gatech.edu

Abstract. Multi-modal control is a commonly used design tool for breaking up complex control tasks into sequences of simpler tasks. In this paper, we show that by viewing the contr...

Tejas R. Mehta, Magnus Egerstedt

claim paper

Read More »

160

click to vote

CSL
2010
Springer

238views Automated Reasoning» more CSL 2010»

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

15 years 4 months ago

Download mi.eng.cam.ac.uk

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...

Blaise Thomson, Steve Young

claim paper

Read More »

148

click to vote

IJCNN
2008
IEEE

202views Neural Networks» more IJCNN 2008»

Learning to select relevant perspective in a dynamic environment

15 years 10 months ago

Download www.cs.qub.ac.uk

— When an agent observes its environment, there are two important characteristics of the perceived information. One is the relevance of information and the other is redundancy. T...

Zhihui Luo, David A. Bell, Barry McCollum, Qingxia...

claim paper

Read More »

162

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 4 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

138

click to vote

ECML
2004
Springer

154views Machine Learning» more ECML 2004»

Experiments in Value Function Approximation with Sparse Support Vector Regression

15 years 9 months ago

Download userweb.cs.utexas.edu

Abstract. We present ﬁrst experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...

Tobias Jung, Thomas Uthmann

claim paper

Read More »

« Prev « First page 199 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers