Sciweavers

1435 search results - page 104 / 287
» Generalization Error Bounds Using Unlabeled Data
Sort
View
130
Voted
KDD
2000
ACM
121views Data Mining» more  KDD 2000»
15 years 6 months ago
Mining high-speed data streams
Many organizations today have more than very large databases; they have databases that grow without limit at a rate of several million records per day. Mining these continuous dat...
Pedro Domingos, Geoff Hulten
114
Voted
ICASSP
2010
IEEE
15 years 3 months ago
Discriminative training methods for language models using conditional entropy criteria
This paper addresses the problem of discriminative training of language models that does not require any transcribed acoustic data. We propose to minimize the conditional entropy ...
Jui-Ting Huang, Xiao Li, Alex Acero
119
Voted
BMCBI
2008
193views more  BMCBI 2008»
15 years 2 months ago
Missing value imputation for microarray gene expression data using histone acetylation information
Background: It is an important pre-processing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile ...
Qian Xiang, Xianhua Dai, Yangyang Deng, Caisheng H...
135
Voted
SSDBM
2003
IEEE
95views Database» more  SSDBM 2003»
15 years 8 months ago
A Quad-Tree Based Multiresolution Approach for Two-dimensional Summary Data
In many application contexts, like statistical databases, scientific databases, query optimizers, OLAP, and so on, data are often summarized into synopses of aggregate values. Su...
Francesco Buccafurri, Filippo Furfaro, Domenico Sa...
116
Voted
HRI
2006
ACM
15 years 8 months ago
Using context and sensory data to learn first and second person pronouns
We present a method of grounded word learning that is powerful enough to learn the meanings of first and second person pronouns. The model uses the understood words in an utteran...
Kevin Gold, Brian Scassellati