Sciweavers

2711 search results - page 59 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
137
Voted
ATAL
2008
Springer
15 years 5 months ago
Using adaptive consultation of experts to improve convergence rates in multiagent learning
In this paper we study the use of experts algorithms in a multiagent setting. In this paper we allow agents to use multiple experts and explore different experts algorithms that a...
Greg Hines, Kate Larson
147
Voted
IJCAI
2001
15 years 5 months ago
Rational and Convergent Learning in Stochastic Games
This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...
Michael H. Bowling, Manuela M. Veloso
135
Voted
GECCO
2008
Springer
152views Optimization» more  GECCO 2008»
15 years 4 months ago
Designing EDAs by using the elitist convergent EDA concept and the boltzmann distribution
This paper presents a theoretical definition for designing EDAs called Elitist Convergent Estimation of Distribution Algorithm (ECEDA), and a practical implementation: the Boltzm...
Sergio Ivvan Valdez Peña, Arturo Hern&aacut...
138
Voted
EMMCVPR
2001
Springer
15 years 8 months ago
A Double-Loop Algorithm to Minimize the Bethe Free Energy
Recent work (Yedidia, Freeman, Weiss [22]) has shown that stable points of belief propagation (BP) algorithms [12] for graphs with loops correspond to extrema of the Bethe free ene...
Alan L. Yuille
ML
2007
ACM
192views Machine Learning» more  ML 2007»
15 years 3 months ago
Annealing stochastic approximation Monte Carlo algorithm for neural network training
We propose a general-purpose stochastic optimization algorithm, the so-called annealing stochastic approximation Monte Carlo (ASAMC) algorithm, for neural network training. ASAMC c...
Faming Liang