Sciweavers

575 search results - page 102 / 115
» Reinforcement Learning State Estimator
Sort
View
ICML
2009
IEEE
15 years 10 months ago
Hilbert space embeddings of conditional distributions with applications to dynamical systems
In this paper, we extend the Hilbert space embedding approach to handle conditional distributions. We derive a kernel estimate for the conditional embedding, and show its connecti...
Le Song, Jonathan Huang, Alexander J. Smola, Kenji...
ICML
2005
IEEE
15 years 10 months ago
Finite time bounds for sampling based fitted value iteration
In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...
Csaba Szepesvári, Rémi Munos
AI
2006
Springer
14 years 10 months ago
Robot introspection through learned hidden Markov models
In this paper we describe a machine learning approach for acquiring a model of a robot behaviour from raw sensor data. We are interested in automating the acquisition of behaviour...
Maria Fox, Malik Ghallab, Guillaume Infantes, Dere...
ICFP
2008
ACM
15 years 9 months ago
Write it recursively: a generic framework for optimal path queries
Optimal path queries are queries to obtain an optimal path specified by a given criterion of optimality. There have been many studies to give efficient algorithms for classes of o...
Akimasa Morihata, Kiminori Matsuzaki, Masato Takei...
IADIS
2003
14 years 11 months ago
Adaptive Web Service for QOS Improvement
In this paper we investigate how “self-awareness'', through on-line self-monitoring and measurement, coupled with intelligent adaptive behaviour in response to observe...
Erol Gelenbe, Arturo Núñez