Search Sciweavers | Sciweavers

575 search results - page 70 / 115

» Reinforcement Learning State Estimator

click to vote

NIPS
2001

192views Information Technology» more NIPS 2001»

Predictive Representations of State

14 years 11 months ago

Download www.eecs.umich.edu

We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...

Michael L. Littman, Richard S. Sutton, Satinder P....

claim paper

Read More »

102

click to vote

CI
2005

106views more CI 2005»

Incremental Learning of Procedural Planning Knowledge in Challenging Environments

14 years 10 months ago

Download www.sunnyhome.org

Autonomous agents that learn about their environment can be divided into two broad classes. One class of existing learners, reinforcement learners, typically employ weak learning ...

Douglas J. Pearson, John E. Laird

claim paper

Read More »

click to vote

AAAI
2010

174views Intelligent Agents» more AAAI 2010»

To Max or Not to Max: Online Learning for Speeding Up Optimal Planning

14 years 11 months ago

Download www.technion.ac.il

It is well known that there cannot be a single "best" heuristic for optimal planning in general. One way of overcoming this is by combining admissible heuristics (e.g. b...

Carmel Domshlak, Erez Karpas, Shaul Markovitch

claim paper

Read More »

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

14 years 4 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

click to vote

INTERSPEECH
2010

108views Signal Processing» more INTERSPEECH 2010»

Incremental word learning using large-margin discriminative training and variance floor estimation

14 years 4 months ago

Download aiweb.techfak.uni-bielefeld.de

We investigate incremental word learning in a Hidden Markov Model (HMM) framework suitable for human-robot interaction. In interactive learning, the tutoring time is a crucial fac...

Irene Ayllón Clemente, Martin Heckmann, Ale...

claim paper

Read More »

« Prev « First page 70 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers