Sciweavers

2711 search results - page 424 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
ATAL
2008
Springer
15 years 10 hour ago
Shared focus of attention for heterogeneous agents
A network of cooperating agents must be able to reach rough consensus on a set of topics for cooperation. With highly heterogeneous agents, however, incommensurable measures and i...
Jacob Beal
CDC
2008
IEEE
15 years 4 hour ago
Distributed estimation and control for stochastically interacting robots
Abstract-- We introduce a distributed estimation algorithm for use by a collection of stochastically interacting agents. Each agent has both a discrete value and an estimate of the...
Fayette W. Shaw, Eric Klavins
COLT
2005
Springer
14 years 12 months ago
Loss Bounds for Online Category Ranking
Category ranking is the task of ordering labels with respect to their relevance to an input instance. In this paper we describe and analyze several algorithms for online category r...
Koby Crammer, Yoram Singer
COLT
2008
Springer
14 years 11 months ago
Learning Coordinate Gradients with Multi-Task Kernels
Coordinate gradient learning is motivated by the problem of variable selection and determining variable covariation. In this paper we propose a novel unifying framework for coordi...
Yiming Ying, Colin Campbell
ECAI
2008
Springer
14 years 11 months ago
Exploiting locality of interactions using a policy-gradient approach in multiagent learning
In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...
Francisco S. Melo