Sciweavers

180 search results - page 15 / 36
» On the Convergence Rate of Good-Turing Estimators
Sort
View
IJCAI
2001
14 years 12 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
ICASSP
2011
IEEE
14 years 2 months ago
Empirical divergence maximization for quantizer design: An analysis of approximation error
Empirical divergence maximization is an estimation method similar to empirical risk minimization whereby the Kullback-Leibler divergence is maximized over a class of functions tha...
Michael A. Lexa
VTC
2008
IEEE
15 years 5 months ago
A Bit-Mapping Strategy for Joint Iterative Channel Estimation and Turbo-Decoding
Abstract— In this paper, we investigate Turbo-coded transmission over a temporally correlated flat Rayleigh fading channel. Conventionally, channel estimation is performed prior...
Susanne Godtmann, Helge Lüders, Gerd Ascheid,...
TNN
2008
82views more  TNN 2008»
14 years 10 months ago
Deterministic Learning for Maximum-Likelihood Estimation Through Neural Networks
In this paper, a general method for the numerical solution of maximum-likelihood estimation (MLE) problems is presented; it adopts the deterministic learning (DL) approach to find ...
Cristiano Cervellera, Danilo Macciò, Marco ...
MASCOTS
2008
14 years 12 months ago
Lifetime Estimation of Large IEEE 802.15.4 Compliant Wireless Sensor Networks
Lifetime of a wireless sensor network is affected by key factors such as network architecture, network size, sensor node population model, data generation rate, initial battery bu...
Carol Fung, Yanni Ellen Liu