Sciweavers

20 search results - page 3 / 4
» vldb 2009
Sort
View
VLDB
2009
ACM
168views Database» more  VLDB 2009»
14 years 6 months ago
Swoosh: a generic approach to entity resolution
Omar Benjelloun, Hector Garcia-Molina, David Menes...
VLDB
2009
ACM
182views Database» more  VLDB 2009»
14 years 6 months ago
Guessing the extreme values in a data set: a Bayesian method and its applications
For a largenumber of data management problems, it would be very useful to be able to obtain a few samples from a data set, and to use the samples to guess the largest (or smallest)...
Mingxi Wu, Chris Jermaine
VLDB
2009
ACM
143views Database» more  VLDB 2009»
14 years 6 months ago
Sampling-based estimators for subset-based queries
We consider the problem of using sampling to estimate the result of an aggregation operation over a subset-based SQL query, where a subquery is correlated to an outer query by a NO...
Shantanu Joshi, Christopher M. Jermaine
VLDB
2009
ACM
147views Database» more  VLDB 2009»
14 years 6 months ago
Privacy-preserving indexing of documents on the network
We address the problem of providing privacypreserving search over distributed accesscontrolled content. Indexed documents can be easily reconstructed from conventional (inverted) ...
Mayank Bawa, Rakesh Agrawal, Roberto J. Bayardo Jr...
VLDB
2009
ACM
159views Database» more  VLDB 2009»
14 years 6 months ago
Anytime measures for top-k algorithms on exact and fuzzy data sets
Top-k queries on large multi-attribute data sets are fundamental operations in information retrieval and ranking applications. In this article, we initiate research on the anytime ...
Benjamin Arai, Gautam Das, Dimitrios Gunopulos, Ni...