ICML 2002 | Sciweavers

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

16

click to vote

ICML
2002
IEEE

156views Machine Learning» more ICML 2002»

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

14 years 5 months ago

Download select.cs.cmu.edu

One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...

Carlos Guestrin, Relu Patrascu, Dale Schuurmans

claim paper

Read More »

9

click to vote

ICML
2002
IEEE

133views Machine Learning» more ICML 2002»

Coordinated Reinforcement Learning

14 years 5 months ago

Download select.cs.cmu.edu

We present several new algorithms for multiagent reinforcement learning. A common feature of these algorithms is a parameterized, structured representation of a policy or value fu...

Carlos Guestrin, Michail G. Lagoudakis, Ronald Par...

claim paper

Read More »

14

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

14 years 5 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

14

click to vote

ICML
2002
IEEE

215views Machine Learning» more ICML 2002»

Combining Labeled and Unlabeled Data for MultiClass Text Categorization

14 years 5 months ago

Download www.accenture.com

Supervised learning techniques for text classi cation often require a large number of labeled examples to learn accurately. One way to reduce the amountoflabeled datarequired is t...

Rayid Ghani

claim paper

Read More »

14

click to vote

ICML
2002
IEEE

139views Machine Learning» more ICML 2002»

Multi-Instance Kernels

14 years 5 months ago

Download sci2s.ugr.es

Learning from structured data is becoming increasingly important. However, most prior work on kernel methods has focused on learning from attribute-value data. Only recently, rese...

Adam Kowalczyk, Alex J. Smola, Peter A. Flach, Tho...

claim paper

Read More »

12

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

On generalization bounds, projection profile, and margin distribution

14 years 5 months ago

Download valis.cs.uiuc.edu

We study generalization properties of linear learning algorithms and develop a data dependent approach that is used to derive generalization bounds that depend on the margin distr...

Ashutosh Garg, Sariel Har-Peled, Dan Roth

claim paper

Read More »

8

click to vote

ICML
2002
IEEE

95views Machine Learning» more ICML 2002»

Univariate Polynomial Inference by Monte Carlo Message Length Approximation

14 years 5 months ago

Download www.csse.monash.edu.au

We apply the Message from Monte Carlo (MMC) algorithm to inference of univariate polynomials. MMC is an algorithm for point estimation from a Bayesian posterior sample. It partiti...

Leigh J. Fitzgibbon, David L. Dowe, Lloyd Allison

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers