Search Sciweavers | Sciweavers

84

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

16 years 24 days ago

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

88

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

Learning with Positive and Unlabeled Examples Using Weighted Logistic Regression

16 years 24 days ago

Download www.cs.uic.edu

The problem of learning with positive and unlabeled examples arises frequently in retrieval applications. We transform the problem into a problem of learning with noise by labelin...

Wee Sun Lee, Bing Liu

claim paper

Read More »

85

Voted

ICML
2003
IEEE

114views Machine Learning» more ICML 2003»

Unsupervised Learning with Permuted Data

16 years 24 days ago

Download www.hpl.hp.com

We consider the problem of unsupervised learning from a matrix of data vectors where in each row the observed values are randomly permuted in an unknown fashion. Such problems ari...

Sergey Kirshner, Sridevi Parise, Padhraic Smyth

claim paper

Read More »

88

click to vote

ICML
2003
IEEE

140views Machine Learning» more ICML 2003»

Finding Underlying Connections: A Fast Graph-Based Method for Link Analysis and Collaboration Queries

16 years 24 days ago

Download www.cs.cmu.edu

Many techniques in the social sciences and graph theory deal with the problem of examining and analyzing patterns found in the underlying structure and associations of a group of ...

Jeremy Kubica, Andrew W. Moore, David Cohn, Jeff G...

claim paper

Read More »

89

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 24 days ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers