Search Sciweavers | Sciweavers

75 search results - page 1 / 15

» Reinforcement Learning for MDPs with Constraints

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

13 years 6 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

click to vote

SAB
2010
Springer

189views Optimization» more SAB 2010»

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs

13 years 2 months ago

Download www.isir.upmc.fr

Reinforcement learning is one of the main adaptive mechanisms that is both well documented in animal behaviour and giving rise to computational studies in animats and robots. In th...

Olga Kozlova, Olivier Sigaud, Christophe Meyer

claim paper

Read More »

click to vote

AI
1999
Springer

110views Artificial Intelligence» more AI 1999»

Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning

13 years 4 months ago

Download webdocs.cs.ualberta.ca

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

13 years 6 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

click to vote

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

14 years 5 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

« Prev « First page 1 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers