Sciweavers

1950 search results - page 270 / 390
» Informative sampling for large unbalanced data sets
Sort
View
147
Voted
SIGIR
2002
ACM
15 years 3 months ago
Cross-document summarization by concept classification
In this paper we describe a Cross Document Summarizer XDoX designed specifically to summarize large document sets (50-500 documents and more). Such sets of documents are typically...
Hilda Hardy, Nobuyuki Shimizu, Tomek Strzalkowski,...
148
Voted
ESORICS
2005
Springer
15 years 9 months ago
Privacy Preserving Clustering
The freedom and transparency of information flow on the Internet has heightened concerns of privacy. Given a set of data items, clustering algorithms group similar items together...
Somesh Jha, Louis Kruger, Patrick McDaniel
123
Voted
CSL
2006
Springer
15 years 3 months ago
Unsupervised grammar induction using history based approach
Grammar induction, also known as grammar inference, is one of the most important research areas in the domain of natural language processing. Availability of large corpora has enc...
Heshaam Feili, Gholamreza Ghassem-Sani
PODS
2005
ACM
151views Database» more  PODS 2005»
16 years 3 months ago
Estimating arbitrary subset sums with few probes
Suppose we have a large table T of items i, each with a weight wi, e.g., people and their salary. In a general preprocessing step for estimating arbitrary subset sums, we assign e...
Noga Alon, Nick G. Duffield, Carsten Lund, Mikkel ...
148
Voted
SSDBM
2005
IEEE
128views Database» more  SSDBM 2005»
15 years 9 months ago
Fuzzy Decomposition of Spatially Extended Objects
Modern database applications including computer-aided design, multimedia information systems, medical imaging, molecular biology, or geographical information systems impose new re...
Hans-Peter Kriegel, Martin Pfeifle