Sciweavers

2614 search results - page 255 / 523
» Customizable Data Distribution for Shared Data Spaces
Sort
View
FAST
2011
14 years 8 months ago
A Study of Practical Deduplication
We collected file system content data from 857 desktop computers at Microsoft over a span of 4 weeks. We analyzed the data to determine the relative efficacy of data deduplication...
Dutch T. Meyer, William J. Bolosky
SISAP
2008
IEEE
188views Data Mining» more  SISAP 2008»
15 years 11 months ago
High-Dimensional Similarity Retrieval Using Dimensional Choice
There are several pieces of information that can be utilized in order to improve the efficiency of similarity searches on high-dimensional data. The most commonly used information...
Dave Tahmoush, Hanan Samet
MSS
2005
IEEE
138views Hardware» more  MSS 2005»
15 years 10 months ago
EOSDIS Petabyte Archives: Tenth Anniversary
One of the world’s largest scientific data systems, NASA’s Earth Observing System Data and Information System (EOSDIS) has stored over three petabytes of earth science data in...
Jeanne Behnke, Tonjua Hines Watts, Ben Kobler, Daw...
FAST
2010
15 years 7 months ago
Provenance for the Cloud
The cloud is poised to become the next computing environment for both data storage and computation due to its pay-as-you-go and provision-as-you-go models. Cloud storage is alread...
Kiran-Kumar Muniswamy-Reddy, Peter Macko, Margo I....
ICDE
2005
IEEE
110views Database» more  ICDE 2005»
15 years 10 months ago
Locality Aware Networked Join Evaluation
We pose the question: how do we efficiently evaluate a join operator, distributed over a heterogeneous network? Our objective here is to optimize the delay of output tuples. We di...
Yanif Ahmad, Ugur Çetintemel, John Jannotti...