Sciweavers

2711 search results - page 72 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
122
Voted
APPROX
2006
Springer
121views Algorithms» more  APPROX 2006»
15 years 7 months ago
A Randomized Solver for Linear Systems with Exponential Convergence
Abstract. The Kaczmarz method for solving linear systems of equations Ax = b is an iterative algorithm that has found many applications ranging from computer tomography to digital ...
Thomas Strohmer, Roman Vershynin
COLT
2004
Springer
15 years 9 months ago
Reinforcement Learning for Average Reward Zero-Sum Games
Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The first is based on relative Q-learning and the ...
Shie Mannor
111
Voted
COLT
2000
Springer
15 years 8 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
144
Voted
PIMRC
2008
IEEE
15 years 10 months ago
Iterative EM based channel estimation for KSP-OFDM
Abstract—This paper proposes a new iterative channel estimation algorithm for known symbol padding (KSP) Orthogonal Frequency Division Multiplexing (OFDM) based on the Expectatio...
Dieter Van Welden, Heidi Steendam
150
Voted
TSP
2010
14 years 10 months ago
Distributed consensus with quantized data via sequence averaging
The problem of distributed average consensus with quantized data is considered in this correspondence. Conventional consensus algorithms suffer from divergence when quantization er...
Jun Fang, Hongbin Li