Sciweavers

9521 search results - page 396 / 1905
» Compiling with continuations, continued
Sort
View
ICML
2006
IEEE
16 years 5 months ago
PAC model-free reinforcement learning
For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...
Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...
ICML
2005
IEEE
16 years 5 months ago
Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM
This paper explores the issue of recognizing, generalizing and reproducing arbitrary gestures. We aim at extracting a representation that encapsulates only the key aspects of the ...
Sylvain Calinon, Aude Billard
ICML
2003
IEEE
16 years 5 months ago
Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning
We present a novel Bayesian approach to the problem of value function estimation in continuous state spaces. We define a probabilistic generative model for the value function by i...
Yaakov Engel, Shie Mannor, Ron Meir
CHI
2008
ACM
16 years 4 months ago
Intelligent object group selection
Current object group selection techniques such as lasso or rectangle selection can be time consuming and error prone. This is apparent when selecting distant objects on a large di...
Hoda Dehmeshki, Wolfgang Stürzlinger
CHI
2003
ACM
16 years 4 months ago
The evolution of buildings and implications for the design of ubiquitous domestic environments
This paper considers how we may realize future ubiquitous domestic environments. Building upon previous work on how buildings evolve by Stewart Brand, we suggest the need to broad...
Tom Rodden, Steve Benford