Search Sciweavers | Sciweavers

3050 search results - page 105 / 610

» On-line Algorithms in Machine Learning

114

click to vote

ICML
2007
IEEE

158views Machine Learning» more ICML 2007»

A bound on the label complexity of agnostic active learning

16 years 4 months ago

Download www.cs.cmu.edu

We study the label complexity of pool-based active learning in the agnostic PAC model. Specifically, we derive general bounds on the number of label requests made by the A2 algori...

Steve Hanneke

claim paper

Read More »

109

Voted

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

16 years 4 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

Voted

ECML
2000
Springer

74views Machine Learning» more ECML 2000»

Layered Learning

15 years 8 months ago

Download www-lrn.cs.umass.edu

We examine how a network of many knowledge layers can be constructed in an on-line manner, such that the learned units represent building blocks of knowledge that serve to compres...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

135

Voted

EWRL
2008

121views Machine Learning» more EWRL 2008»

Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem

15 years 5 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

Two variable metric reinforcement learning methods, the natural actor-critic algorithm and the covariance matrix adaptation evolution strategy, are compared on a conceptual level a...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

158

Voted

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 5 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

« Prev « First page 105 / 610 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers