Search Sciweavers | Sciweavers

4345 search results - page 64 / 869

» Relational Reinforcement Learning

click to vote

ICML
2004
IEEE

146views Machine Learning» more ICML 2004»

Dynamic abstraction in reinforcement learning via clustering

16 years 4 months ago

Download rlai.cs.ualberta.ca

Abstraction in Reinforcement Learning via Clustering Shie Mannor shie@mit.edu Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA ...

Shie Mannor, Ishai Menache, Amit Hoze, Uri Klein

claim paper

Read More »

108

Voted

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

16 years 4 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

109

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

15 years 5 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

132

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

15 years 9 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

116

click to vote

AAAI
1998

122views Intelligent Agents» more AAAI 1998»

A Framework for Reinforcement Learning on Real Robots

15 years 4 months ago

Download www.cs.wustl.edu

Learning on real robots in an real, unaltered environment provides an extremely challenging problem. Many of the simplifying assumptions made in other areas of learning cannot be ...

William D. Smart, Leslie Pack Kaelbling

claim paper

Read More »

« Prev « First page 64 / 869 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers