Search Sciweavers | Sciweavers

417 search results - page 27 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

130

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 9 days ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

112

click to vote

ICML
2010
IEEE

213views Machine Learning» more ICML 2010»

Nonparametric Information Theoretic Clustering Algorithm

15 years 2 months ago

Download www.icml2010.org

In this paper we propose a novel clustering algorithm based on maximizing the mutual information between data points and clusters. Unlike previous methods, we neither assume the d...

Lev Faivishevsky, Jacob Goldberger

claim paper

Read More »

122

click to vote

ICML
2004
IEEE

163views Machine Learning» more ICML 2004»

Multi-task feature and kernel selection for SVMs

16 years 3 months ago

Download www1.cs.columbia.edu

We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is ...

Tony Jebara

claim paper

Read More »

171

click to vote

SGAI
2010
Springer

226views Artificial Intelligence» more SGAI 2010»

Hierarchical Traces for Reduced NSM Memory Requirements

15 years 5 days ago

Download staff.newport.ac.uk

This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based ...

Torbjørn S. Dahl

claim paper

Read More »

115

click to vote

ICML
2010
IEEE

258views Machine Learning» more ICML 2010»

Feature Selection as a One-Player Game

15 years 3 months ago

Download www.lri.fr

This paper formalizes Feature Selection as a Reinforcement Learning problem, leading to a provably optimal though intractable selection policy. As a second contribution, this pape...

Romaric Gaudel, Michèle Sebag

claim paper

Read More »

« Prev « First page 27 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers