Sciweavers

1435 search results - page 104 / 287
» Generalization Error Bounds Using Unlabeled Data
Sort
View
KDD
2000
ACM
121views Data Mining» more  KDD 2000»
15 years 1 months ago
Mining high-speed data streams
Many organizations today have more than very large databases; they have databases that grow without limit at a rate of several million records per day. Mining these continuous dat...
Pedro Domingos, Geoff Hulten
ICASSP
2010
IEEE
14 years 10 months ago
Discriminative training methods for language models using conditional entropy criteria
This paper addresses the problem of discriminative training of language models that does not require any transcribed acoustic data. We propose to minimize the conditional entropy ...
Jui-Ting Huang, Xiao Li, Alex Acero
BMCBI
2008
193views more  BMCBI 2008»
14 years 10 months ago
Missing value imputation for microarray gene expression data using histone acetylation information
Background: It is an important pre-processing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile ...
Qian Xiang, Xianhua Dai, Yangyang Deng, Caisheng H...
SSDBM
2003
IEEE
95views Database» more  SSDBM 2003»
15 years 3 months ago
A Quad-Tree Based Multiresolution Approach for Two-dimensional Summary Data
In many application contexts, like statistical databases, scientific databases, query optimizers, OLAP, and so on, data are often summarized into synopses of aggregate values. Su...
Francesco Buccafurri, Filippo Furfaro, Domenico Sa...
HRI
2006
ACM
15 years 3 months ago
Using context and sensory data to learn first and second person pronouns
We present a method of grounded word learning that is powerful enough to learn the meanings of first and second person pronouns. The model uses the understood words in an utteran...
Kevin Gold, Brian Scassellati