Sciweavers

151 search results - page 19 / 31
» Online Square Packing
Sort
View
ATAL
2009
Springer
15 years 4 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
COLT
2006
Springer
15 years 1 months ago
A Randomized Online Learning Algorithm for Better Variance Control
We propose a sequential randomized algorithm, which at each step concentrates on functions having both low risk and low variance with respect to the previous step prediction functi...
Jean-Yves Audibert
CISS
2008
IEEE
15 years 4 months ago
Fusion frames and robust dimension reduction
Abstract— We consider the linear minimum meansquared error (LMMSE) estimation of a random vector of interest from its fusion frame measurements in presence noise and subspace era...
Ali Pezeshki, Gitta Kutyniok, A. Robert Calderbank
VTC
2006
IEEE
141views Communications» more  VTC 2006»
15 years 3 months ago
Minimum Distance-Based Limited-Feedback Precoder for MIMO Spatial Multiplexing Systems
— Precoding is a well-known method to reach the promised performance and capacity of multiple-input multipleoutput (MIMO) systems. Recent investigations, when the transmitter has...
Alireza Ghaderipoor, Chintha Tellambura
CORR
2007
Springer
77views Education» more  CORR 2007»
14 years 9 months ago
Quantization Bounds on Grassmann Manifolds of Arbitrary Dimensions and MIMO Communications with Feedback
— This paper considers the quantization problem on the Grassmann manifold with dimension n and p. The unique contribution is the derivation of a closed-form formula for the volum...
Wei Dai, Youjian Liu, Brian Rider