Search Sciweavers | Sciweavers

161

ICML
2009
IEEE

153views Machine Learning» more ICML 2009»

Bandit-based optimization on graphs with application to library performance tuning

16 years 7 months ago

The problem of choosing fast implementations for a class of recursive algorithms such as the fast Fourier transforms can be formulated as an optimization problem over the language...

Arpad Rimmel, Frédéric de Mesmay, Ma...

claim paper

Read More »

180

click to vote

ICML
2007
IEEE

179views Machine Learning» more ICML 2007»

Bottom-up learning of Markov logic network structure

16 years 7 months ago

Download www.machinelearning.org

Markov logic networks (MLNs) are a statistical relational model that consists of weighted firstorder clauses and generalizes first-order logic and Markov networks. The current sta...

Lilyana Mihalkova, Raymond J. Mooney

claim paper

Read More »

156

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

180

click to vote

ICML
2006
IEEE

148views Machine Learning» more ICML 2006»

Bayesian pattern ranking for move prediction in the game of Go

16 years 7 months ago

Download research.microsoft.com

We investigate the problem of learning to predict moves in the board game of Go from game records of expert players. In particular, we obtain a probability distribution over legal...

David H. Stern, Ralf Herbrich, Thore Graepel

claim paper

Read More »

210

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 7 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers