Sciweavers

109 search results - page 21 / 22
» Evaluating top-k queries over incomplete data streams
Sort
View
DKE
2008
109views more  DKE 2008»
14 years 9 months ago
Deterministic algorithms for sampling count data
Processing and extracting meaningful knowledge from count data is an important problem in data mining. The volume of data is increasing dramatically as the data is generated by da...
Hüseyin Akcan, Alex Astashyn, Hervé Br...
91
Voted
LREC
2008
112views Education» more  LREC 2008»
14 years 11 months ago
A Ground Truth Dataset for Matching Culturally Diverse Romanized Person Names
This paper describes the development of a ground truth dataset of culturally diverse Romanized names in which approximately 70,000 names are matched against a subset of 700. We ra...
Mark Arehart, Keith J. Miller
SIGMOD
2010
ACM
267views Database» more  SIGMOD 2010»
15 years 2 months ago
Processing proximity relations in road networks
Applications ranging from location-based services to multi-player online gaming require continuous query support to monitor, track, and detect events of interest among sets of mov...
Zhengdao Xu, Hans-Arno Jacobsen
USENIX
2007
14 years 12 months ago
Using Provenance to Aid in Personal File Search
As the scope of personal data grows, it becomes increasingly difficult to find what we need when we need it. Desktop search tools provide a potential answer, but most existing too...
Sam Shah, Craig A. N. Soules, Gregory R. Ganger, B...
ICDE
2007
IEEE
116views Database» more  ICDE 2007»
15 years 11 months ago
MultiMap: Preserving disk locality for multidimensional datasets
MultiMap is an algorithm for mapping multidimensional datasets so as to preserve the data's spatial locality on disks. Without revealing disk-specific details to applications...
Minglong Shao, Steven W. Schlosser, Stratos Papado...