Sciweavers

2711 search results - page 437 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
ATAL
2010
Springer
14 years 11 months ago
Self-organization for coordinating decentralized reinforcement learning
Decentralized reinforcement learning (DRL) has been applied to a number of distributed applications. However, one of the main challenges faced by DRL is its convergence. Previous ...
Chongjie Zhang, Victor R. Lesser, Sherief Abdallah
CEC
2010
IEEE
14 years 11 months ago
A mixed strategy for Evolutionary Programming based on local fitness landscape
The performance of Evolutionary Programming (EP) is affected by many factors (e.g. mutation operators and selection strategies). Although the conventional approach with Gaussian mu...
Liang Shen, Jun He
CORR
2010
Springer
127views Education» more  CORR 2010»
14 years 10 months ago
Mean field for Markov Decision Processes: from Discrete to Continuous Optimization
We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal...
Nicolas Gast, Bruno Gaujal, Jean-Yves Le Boudec
CORR
2008
Springer
149views Education» more  CORR 2008»
14 years 10 months ago
Gaussian Belief Propagation for Solving Systems of Linear Equations: Theory and Application
The canonical problem of solving a system of linear equations arises in numerous contexts in information theory, communication theory, and related fields. In this contribution, we...
Ori Shental, Danny Bickson, Paul H. Siegel, Jack K...
IJRR
2006
103views more  IJRR 2006»
14 years 10 months ago
Adaptive Tracking Control for Robots with Unknown Kinematic and Dynamic Properties
It has been almost two decades since the first globally tracking convergent adaptive controllers were derived for robot with dynamic uncertainties. However, the problem of concurr...
Chien-Chern Cheah, Chao Liu 0003, Jean-Jacques E. ...