Sciweavers

15 search results - page 3 / 3
» Incremental Least-Squares Temporal Difference Learning
Sort
View
ICML
2009
IEEE
14 years 6 months ago
Binary action search for learning continuous-action control policies
Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...
Jason Pazis, Michail G. Lagoudakis
NIPS
2007
13 years 7 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
ECAI
2004
Springer
13 years 11 months ago
A Backtracking Strategy for Order-Independent Incremental Learning
Agents that exist in an environment that changes over time, and are able to take into account the temporal nature of experience, are commonly called incremental learners. It is wid...
Nicola Di Mauro, Floriana Esposito, Stefano Ferill...
PAMI
2007
134views more  PAMI 2007»
13 years 5 months ago
Spatio-Temporal Context for Robust Multitarget Tracking
—In multitarget tracking, the main challenge is to maintain the correct identity of targets even under occlusions or when differences between the targets are small. The paper pro...
Hieu Tat Nguyen, Qiang Ji, Arnold W. M. Smeulders
CPAIOR
2006
Springer
13 years 9 months ago
An Efficient Hybrid Strategy for Temporal Planning
Temporal planning (TP) is notoriously difficult because it requires to solve a propositional STRIPS planning problem with temporal constraints. In this paper, we propose an efficie...
Zhao Xing, Yixin Chen, Weixiong Zhang