Sciweavers

1235 search results - page 246 / 247
» Reinforcement learning in a nutshell
Sort
View
DAGSTUHL
2001
13 years 6 months ago
Decision-Theoretic Control of Planetary Rovers
Planetary rovers are small unmanned vehicles equipped with cameras and a variety of sensors used for scientific experiments. They must operate under tight constraints over such res...
Shlomo Zilberstein, Richard Washington, Daniel S. ...
IADIS
2003
13 years 6 months ago
Adaptive Web Service for QOS Improvement
In this paper we investigate how “self-awareness'', through on-line self-monitoring and measurement, coupled with intelligent adaptive behaviour in response to observe...
Erol Gelenbe, Arturo Núñez
IJCAI
2003
13 years 6 months ago
Use of Off-line Dynamic Programming for Efficient Image Interpretation
An interpretation system finds the likely mappings from portions of an image to real-world objects. An interpretation policy specifies when to apply which imaging operator, to whi...
Ramana Isukapalli, Russell Greiner
NIPS
1998
13 years 6 months ago
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...
Michael J. Kearns, Satinder P. Singh
BC
2006
124views more  BC 2006»
13 years 5 months ago
Motor-maps, navigation and implicit space representation in the hippocampus
Abstract Multiple sensory-motor maps located in the brainstem and the cortex are involved in spatial orientation. Guiding movements of eyes, head, neck and arms they provide an app...
Alexander Kaske, Gösta Winberg, Joakim Cö...