Search Sciweavers | Sciweavers

197

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 8 months ago

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

208

click to vote

ICML
2005
IEEE

144views Machine Learning» more ICML 2005»

Healing the relevance vector machine through augmentation

16 years 8 months ago

Download www.machinelearning.org

The Relevance Vector Machine (RVM) is a sparse approximate Bayesian kernel method. It provides full predictive distributions for test cases. However, the predictive uncertainties ...

Carl Edward Rasmussen, Joaquin Quiñonero Ca...

claim paper

Read More »

195

click to vote

ICML
2002
IEEE

215views Machine Learning» more ICML 2002»

Combining Labeled and Unlabeled Data for MultiClass Text Categorization

16 years 8 months ago

Download www.accenture.com

Supervised learning techniques for text classi cation often require a large number of labeled examples to learn accurately. One way to reduce the amountoflabeled datarequired is t...

Rayid Ghani

claim paper

Read More »

207

click to vote

ICML
2002
IEEE

156views Machine Learning» more ICML 2002»

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

16 years 8 months ago

Download select.cs.cmu.edu

One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...

Carlos Guestrin, Relu Patrascu, Dale Schuurmans

claim paper

Read More »

229

click to vote

ICML
2002
IEEE

149views Machine Learning» more ICML 2002»

Learning the Kernel Matrix with Semi-Definite Programming

16 years 8 months ago

Download www.support-vector.net

Kernel-based learning algorithms work by embedding the data into a Euclidean space, and then searching for linear relations among the embedded data points. The embedding is perfor...

Gert R. G. Lanckriet, Nello Cristianini, Peter L. ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers