Search Sciweavers | Sciweavers

2711 search results - page 424 / 543

» Convergence of the Wake-Sleep Algorithm

125

click to vote

ATAL
2008
Springer

114views Intelligent Agents» more ATAL 2008»

Shared focus of attention for heterogeneous agents

15 years 3 months ago

Download web.mit.edu

A network of cooperating agents must be able to reach rough consensus on a set of topics for cooperation. With highly heterogeneous agents, however, incommensurable measures and i...

Jacob Beal

claim paper

Read More »

click to vote

CDC
2008
IEEE

80views Control Systems» more CDC 2008»

Distributed estimation and control for stochastically interacting robots

15 years 3 months ago

Download soslab.ee.washington.edu

Abstract-- We introduce a distributed estimation algorithm for use by a collection of stochastically interacting agents. Each agent has both a discrete value and an estimate of the...

Fayette W. Shaw, Eric Klavins

claim paper

Read More »

141

Voted

COLT
2005
Springer

93views Machine Learning» more COLT 2005»

Loss Bounds for Online Category Ranking

15 years 3 months ago

Download www.cis.upenn.edu

Category ranking is the task of ordering labels with respect to their relevance to an input instance. In this paper we describe and analyze several algorithms for online category r...

Koby Crammer, Yoram Singer

claim paper

Read More »

110

click to vote

COLT
2008
Springer

143views Machine Learning» more COLT 2008»

Learning Coordinate Gradients with Multi-Task Kernels

15 years 3 months ago

Download colt2008.cs.helsinki.fi

Coordinate gradient learning is motivated by the problem of variable selection and determining variable covariation. In this paper we propose a novel unifying framework for coordi...

Yiming Ying, Colin Campbell

claim paper

Read More »

104

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 3 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

« Prev « First page 424 / 543 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers