Search Sciweavers | Sciweavers

12194 search results - page 53 / 2439

» Numberings Optimal for Learning

129

click to vote

DAM
2007

84views more DAM 2007»

Estimates of covering numbers of convex sets with slowly decaying orthogonal subsets

15 years 4 months ago

Download www.dist.unige.it

Covering numbers of precompact symmetric convex subsets of Hilbert spaces are investigated. Lower bounds are derived for sets containing orthogonal subsets with norms of their ele...

Vera Kurková, Marcello Sanguineti

claim paper

Read More »

160

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 2 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

118

click to vote

ICML
2009
IEEE

189views Machine Learning» more ICML 2009»

A simpler unified analysis of budget perceptrons

16 years 5 months ago

Download www.cs.utoronto.ca

The kernel Perceptron is an appealing online learning algorithm that has a drawback: whenever it makes an error it must increase its support set, which slows training and testing ...

Ilya Sutskever

claim paper

Read More »

123

click to vote

CSDA
2006

94views more CSDA 2006»

Signal extraction for simulated games with a large number of players

15 years 4 months ago

Download www.helsinki.fi

A signal extraction problem in simulated games is studied. A modelling technique is proposed for deriving beliefs for players in simulated games. Since standard Bayesian games pro...

Aki Lehtinen

claim paper

Read More »

143

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 5 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 53 / 2439 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers