Search Sciweavers | Sciweavers

664 search results - page 29 / 133

» Combining Reinforcement Learning with a Local Control Algori...

165

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

138

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 3 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

169

click to vote

AAAI
2010

173views Intelligent Agents» more AAAI 2010»

Integrating Sample-Based Planning and Model-Based Reinforcement Learning

15 years 7 months ago

Download paul.rutgers.edu

Recent advancements in model-based reinforcement learning have shown that the dynamics of many structured domains (e.g. DBNs) can be learned with tractable sample complexity, desp...

Thomas J. Walsh, Sergiu Goschin, Michael L. Littma...

claim paper

Read More »

154

click to vote

CORR
2006
Springer

101views Education» more CORR 2006»

Metric State Space Reinforcement Learning for a Vision-Capable Mobile Robot

15 years 6 months ago

Download www.idsia.ch

We address the problem of autonomously learning controllers for visioncapable mobile robots. We extend McCallum's (1995) Nearest-Sequence Memory algorithm to allow for genera...

Viktor Zhumatiy, Faustino J. Gomez, Marcus Hutter,...

claim paper

Read More »

141

click to vote

CIVR
2007
Springer

98views Image Analysis» more CIVR 2007»

Semantics reinforcement and fusion learning for multimedia streams

16 years 10 days ago

Download wang.ist.psu.edu

Fusion of multimedia streams for enhanced performance is a critical problem for retrieval. However, fusion performance tends to easily overﬁt the hillclimb set used to learn fus...

Dhiraj Joshi, Milind R. Naphade, Apostol Natsev

claim paper

Read More »

« Prev « First page 29 / 133 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers