Search Sciweavers | Sciweavers

60 search results - page 3 / 12

» Iteratively Extending Time Horizon Reinforcement Learning

click to vote

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

14 years 23 days ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 4 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

click to vote

ECML
2004
Springer

157views Machine Learning» more ECML 2004»

Model Approximation for HEXQ Hierarchical Reinforcement Learning

13 years 11 months ago

Download www.cse.unsw.edu.au

HEXQ is a reinforcement learning algorithm that discovers hierarchical structure automatically. The generated task hierarchy repthe problem at diﬀerent levels of abstraction. In ...

Bernhard Hengst

claim paper

Read More »

click to vote

FLAIRS
1998

132views Artificial Intelligence» more FLAIRS 1998»

Analytical Design of Reinforcement Learning Tasks

13 years 7 months ago

Download www.aaai.org

Reinforcement learning (RL) problems constitute an important class of learning and control problems faced by artificial intelligence systems. In these problems, one is faced with ...

Robert E. Smith

claim paper

Read More »

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

14 years 7 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

« Prev « First page 3 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers