Search Sciweavers | Sciweavers

417 search results - page 40 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

130

click to vote

ICML
2004
IEEE

122views Machine Learning» more ICML 2004»

Online learning of conditionally I.I.D. data

15 years 7 months ago

Download www.machinelearning.org

In this work we consider the task of relaxing the i.i.d assumption in online pattern recognition (or classiﬁcation), aiming to make existing learning algorithms applicable to a ...

Daniil Ryabko

claim paper

Read More »

145

Voted

DIS
2007
Springer

133views Theoretical Computer Science» more DIS 2007»

A Hilbert Space Embedding for Distributions

15 years 8 months ago

Download www.kyb.tuebingen.mpg.de

We describe a technique for comparing distributions without the need for density estimation as an intermediate step. Our approach relies on mapping the distributions into a reprodu...

Alexander J. Smola, Arthur Gretton, Le Song, Bernh...

claim paper

Read More »

135

click to vote

ATAL
2006
Springer

147views Intelligent Agents» more ATAL 2006»

Learning to cooperate in multi-agent social dilemmas

15 years 6 months ago

Download sequel.futurs.inria.fr

In many Multi-Agent Systems (MAS), agents (even if selfinterested) need to cooperate in order to maximize their own utilities. Most of the multi-agent learning algorithms focus on...

Jose Enrique Munoz de Cote, Alessandro Lazaric, Ma...

claim paper

Read More »

130

Voted

COLT
2005
Springer

93views Machine Learning» more COLT 2005»

Localized Upper and Lower Bounds for Some Estimation Problems

15 years 8 months ago

Download stat.rutgers.edu

Abstract. We derive upper and lower bounds for some statistical estimation problems. The upper bounds are established for the Gibbs algorithm. The lower bounds, applicable for all ...

Tong Zhang

claim paper

Read More »

110

click to vote

COLT
2004
Springer

130views Machine Learning» more COLT 2004»

Performance Guarantees for Regularized Maximum Entropy Density Estimation

15 years 7 months ago

Download www.cs.princeton.edu

Abstract. We consider the problem of estimating an unknown probability distribution from samples using the principle of maximum entropy (maxent). To alleviate overﬁtting with a v...

Miroslav Dudík, Steven J. Phillips, Robert ...

claim paper

Read More »

« Prev « First page 40 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers