Sciweavers

2711 search results - page 241 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
ICML
2001
IEEE
16 years 5 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
OPODIS
2010
15 years 2 months ago
Biased Selection for Building Small-World Networks
Abstract. Small-world networks are currently present in many distributed applications and can be built augmenting a base network with long-range links using a probability distribut...
Andrés Sevilla, Alberto Mozo, M. Araceli Lo...
CVPR
2004
IEEE
16 years 7 months ago
Multiple Kernel Tracking with SSD
Kernel-based objective functions optimized using the mean shift algorithm have been demonstrated as an effective means of tracking in video sequences. The resulting algorithms com...
Gregory D. Hager, Maneesh Dewan, Charles V. Stewar...
ICIP
2002
IEEE
16 years 6 months ago
Image restoration under wavelet-domain priors: an expectation-maximization approach
This paper describes an expectation-maximization (EM) algorithm for wavelet-based image restoration (deconvolution). The observed image is assumed to be a convolved (e.g., blurred...
Robert D. Nowak, Mário A. T. Figueiredo
ICML
2009
IEEE
16 years 5 months ago
Proximal regularization for online and batch learning
Many learning algorithms rely on the curvature (in particular, strong convexity) of regularized objective functions to provide good theoretical performance guarantees. In practice...
Chuong B. Do, Quoc V. Le, Chuan-Sheng Foo