Search Sciweavers | Sciweavers

495 search results - page 57 / 99

» Approximation algorithms for budgeted learning problems

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

14 years 11 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

click to vote

TIP
2008

133views more TIP 2008»

A Recursive Model-Reduction Method for Approximate Inference in Gaussian Markov Random Fields

14 years 11 months ago

Download projects.csail.mit.edu

This paper presents recursive cavity modeling--a principled, tractable approach to approximate, near-optimal inference for large Gauss-Markov random fields. The main idea is to su...

Jason K. Johnson, Alan S. Willsky

claim paper

Read More »

128

Voted

JMLR
2010

149views more JMLR 2010»

Learning Bayesian Network Structure using LP Relaxations

14 years 6 months ago

Download people.csail.mit.edu

We propose to solve the combinatorial problem of finding the highest scoring Bayesian network structure from data. This structure learning problem can be viewed as an inference pr...

Tommi Jaakkola, David Sontag, Amir Globerson, Mari...

claim paper

Read More »

115

click to vote

KDD
2012
ACM

187views Data Mining» more KDD 2012»

Online learning to diversify from implicit feedback

13 years 2 months ago

Download www.cs.cornell.edu

In order to minimize redundancy and optimize coverage of multiple user interests, search engines and recommender systems aim to diversify their set of results. To date, these dive...

Karthik Raman, Pannaga Shivaswamy, Thorsten Joachi...

claim paper

Read More »

137

Voted

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 3 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

« Prev « First page 57 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers