Sciweavers

2711 search results - page 46 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
NIPS
2007
15 years 3 months ago
Fixing Max-Product: Convergent Message Passing Algorithms for MAP LP-Relaxations
We present a novel message passing algorithm for approximating the MAP problem in graphical models. The algorithm is similar in structure to max-product but unlike max-product it ...
Amir Globerson, Tommi Jaakkola
AAIM
2007
Springer
119views Algorithms» more  AAIM 2007»
15 years 5 months ago
An Efficient, and Fast Convergent Algorithm for Barrier Options
Tian-Shyr Dai, Yuh-Dauh Lyuu
NIPS
1998
15 years 2 months ago
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...
Michael J. Kearns, Satinder P. Singh
ICML
1995
IEEE
16 years 2 months ago
Residual Algorithms: Reinforcement Learning with Function Approximation
A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...
Leemon C. Baird III