We propose strategies to efficiently execute a query workload, which consists of multiple related queries submitted against a scientific dataset, on a distributed-memory system in...
We present the Deep Store archival storage architecture, a large-scale storage system that stores immutable data efficiently and reliably for long periods of time. Archived data i...
Lawrence You, Kristal T. Pollack, Darrell D. E. Lo...
Abstract. Due to the high availability of the Internet, many large crossorganization collaboration projects, such as SourceForge, grid systems etc., have emerged. One of the fundam...
Meng-Ru Lin, Ssu-Hsuan Lu, Tsung-Hsuan Ho, Peter L...
Existing keyword-search systems in relational databases require users to submit a complete query to compute answers. Often users feel "left in the dark" when they have l...
Many important problems involve clustering large datasets. Although naive implementations of clustering are computationally expensive, there are established efficient techniques f...