Search Sciweavers | Sciweavers

1017 search results - page 49 / 204

» Constructive and collaborative learning of algorithms

163

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 6 months ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

130

click to vote

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

16 years 6 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

115

click to vote

ECML
2000
Springer

74views Machine Learning» more ECML 2000»

Layered Learning

15 years 9 months ago

Download www-lrn.cs.umass.edu

We examine how a network of many knowledge layers can be constructed in an on-line manner, such that the learned units represent building blocks of knowledge that serve to compres...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

161

click to vote

ICML
2001
IEEE

164views Machine Learning» more ICML 2001»

Learning with the Set Covering Machine

16 years 6 months ago

Download www2.ift.ulaval.ca

We generalize the classical algorithms of Valiant and Haussler for learning conjunctions and disjunctions of Boolean attributes to the problem of learning these functions over arb...

Mario Marchand, John Shawe-Taylor

claim paper

Read More »

158

click to vote

ICML
2010
IEEE

200views Machine Learning» more ICML 2010»

Generalizing Apprenticeship Learning across Hypothesis Classes

15 years 6 months ago

Download paul.rutgers.edu

This paper develops a generalized apprenticeship learning protocol for reinforcementlearning agents with access to a teacher who provides policy traces (transition and reward obse...

Thomas J. Walsh, Kaushik Subramanian, Michael L. L...

claim paper

Read More »

« Prev « First page 49 / 204 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers