Sciweavers

38 search results - page 6 / 8
» Diagnostics for functional regression via residual processes
Sort
View
AAAI
2006
13 years 7 months ago
Incremental Least Squares Policy Iteration for POMDPs
We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...
Hui Li, Xuejun Liao, Lawrence Carin
ICRA
2008
IEEE
169views Robotics» more  ICRA 2008»
14 years 21 days ago
Sparse incremental learning for interactive robot control policy estimation
— We are interested in transferring control policies for arbitrary tasks from a human to a robot. Using interactive demonstration via teloperation as our transfer scenario, we ca...
Daniel H. Grollman, Odest Chadwicke Jenkins
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
13 years 4 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
COMCOM
2006
95views more  COMCOM 2006»
13 years 6 months ago
E-Span and LPT for data aggregation in wireless sensor networks
In wireless sensor networks (WSNs), when a stimulus or event is detected within a particular region, data reports from the neighboring sensor nodes (sources) are sent to the sink ...
Weinan Marc Lee, Vincent W. S. Wong
ICASSP
2007
IEEE
13 years 8 months ago
Variable Regularized Fast Affine Projections
This paper introduces a variable regularization method for the fast affine projection algorithm (VR-FAP). It is inspired by a recently introduced technique for variable regulariza...
Deepak Challa, Steven L. Grant, Asif Iqbal Mohamma...