Search Sciweavers | Sciweavers

210 search results - page 27 / 42

» An analysis of reinforcement learning with function approxim...

111

click to vote

ML
2012
ACM

385views Machine Learning» more ML 2012»

An alternative view of variational Bayes and asymptotic approximations of free energy

13 years 7 months ago

Download hawaii.naist.jp

Bayesian learning, widely used in many applied data-modeling problems, is often accomplished with approximation schemes because it requires intractable computation of the posterio...

Kazuho Watanabe

claim paper

Read More »

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 1 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

107

click to vote

CC
2010
Springer

120views System Software» more CC 2010»

Lower Bounds for Agnostic Learning via Approximate Rank

14 years 9 months ago

Download www.cs.utexas.edu

We prove that the concept class of disjunctions cannot be pointwise approximated by linear combinations of any small set of arbitrary real-valued functions. That is, suppose that t...

Adam R. Klivans, Alexander A. Sherstov

claim paper

Read More »

click to vote

GECCO
2006
Springer

142views Optimization» more GECCO 2006»

Classifier prediction based on tile coding

15 years 3 months ago

Download www.eskimo.com

This paper introduces XCSF extended with tile coding prediction: each classifier implements a tile coding approximator; the genetic algorithm is used to adapt both classifier cond...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

click to vote

ICML
2010
IEEE

258views Machine Learning» more ICML 2010»

Feature Selection as a One-Player Game

15 years 24 days ago

Download www.lri.fr

This paper formalizes Feature Selection as a Reinforcement Learning problem, leading to a provably optimal though intractable selection policy. As a second contribution, this pape...

Romaric Gaudel, Michèle Sebag

claim paper

Read More »

« Prev « First page 27 / 42 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers