Search Sciweavers | Sciweavers

15 search results - page 3 / 3

» Incremental Least-Squares Temporal Difference Learning

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

14 years 6 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

ECAI
2004
Springer

97views Artificial Intelligence» more ECAI 2004»

A Backtracking Strategy for Order-Independent Incremental Learning

13 years 11 months ago

Download www.di.uniba.it

Agents that exist in an environment that changes over time, and are able to take into account the temporal nature of experience, are commonly called incremental learners. It is wid...

Nicola Di Mauro, Floriana Esposito, Stefano Ferill...

claim paper

Read More »

click to vote

PAMI
2007

134views more PAMI 2007»

Spatio-Temporal Context for Robust Multitarget Tracking

13 years 5 months ago

Download www.science.uva.nl

—In multitarget tracking, the main challenge is to maintain the correct identity of targets even under occlusions or when differences between the targets are small. The paper pro...

Hieu Tat Nguyen, Qiang Ji, Arnold W. M. Smeulders

claim paper

Read More »

click to vote

CPAIOR
2006
Springer

125views Operations Research» more CPAIOR 2006»

An Efficient Hybrid Strategy for Temporal Planning

13 years 9 months ago

Download www.cse.wustl.edu

Temporal planning (TP) is notoriously difficult because it requires to solve a propositional STRIPS planning problem with temporal constraints. In this paper, we propose an efficie...

Zhao Xing, Yixin Chen, Weixiong Zhang

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers