Search Sciweavers | Sciweavers

181 search results - page 3 / 37

» State Space Reduction For Hierarchical Reinforcement Learnin...

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

13 years 8 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

click to vote

AUSAI
2007
Springer

97views Artificial Intelligence» more AUSAI 2007»

Safe State Abstraction and Reusable Continuing Subtasks in Hierarchical Reinforcement Learning

13 years 11 months ago

Download www.cse.unsw.edu.au

Bernhard Hengst

claim paper

Read More »

click to vote

IROS
2007
IEEE

157views Robotics» more IROS 2007»

Autonomous blimp control using model-free reinforcement learning in a continuous state and action space

13 years 11 months ago

Download www.informatik.uni-freiburg.de

— In this paper, we present an approach that applies the reinforcement learning principle to the problem of learning height control policies for aerial blimps. In contrast to pre...

Axel Rottmann, Christian Plagemann, Peter Hilgers,...

claim paper

Read More »

click to vote

ICMLA
2004

109views Machine Learning» more ICMLA 2004»

Variable resolution discretization in the joint space

13 years 6 months ago

Download highentropy.com

We present JoSTLe, an algorithm that performs value iteration on control problems with continuous actions, allowing this useful reinforcement learning technique to be applied to p...

Christopher K. Monson, David Wingate, Kevin D. Sep...

claim paper

Read More »

click to vote

AAAI
1998

150views Intelligent Agents» more AAAI 1998»

Tree Based Discretization for Continuous State Space Reinforcement Learning

13 years 6 months ago

Download www.cs.cmu.edu

Reinforcement learning is an effective technique for learning action policies in discrete stochastic environments, but its efficiency can decay exponentially with the size of the ...

William T. B. Uther, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 3 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers