Sciweavers

160 search results - page 26 / 32
» Towards effective partition management for large graphs
Sort
View
DOLAP
1999
ACM
15 years 2 months ago
A Cache Filtering Optimisation for Queries to Massive Datasets on Tertiary Storage
We consider a system in which many users run queries to examine subsets of a large object set. The object set is partitioned into files on tape. A single subset of objects will b...
Koen Holtman, Peter van der Stok, Ian Willers
CAISE
2007
Springer
15 years 4 months ago
Declarative XML Data Cleaning with XClean
Data cleaning is the process of correcting anomalies in a data source, that may for instance be due to typographical errors, or duplicate representations of an entity. It is a cruc...
Melanie Weis, Ioana Manolescu
OSDI
2008
ACM
15 years 10 months ago
Experiences with Content Addressable Storage and Virtual Disks
Efficiently managing storage is important for virtualized computing environments. Its importance is magnified by developments such as cloud computing which consolidate many thousa...
Anthony Liguori, Eric Van Hensbergen
SIGMOD
2009
ACM
139views Database» more  SIGMOD 2009»
15 years 10 months ago
Vispedia: on-demand data integration for interactive visualization and exploration
Wikipedia is an example of the large, collaborative, semi-structured data sets emerging on the Web. Typically, before these data sets can be used, they must transformed into struc...
Bryan Chan, Justin Talbot, Leslie Wu, Nathan Sakun...
VLDB
1999
ACM
134views Database» more  VLDB 1999»
15 years 2 months ago
Capturing and Querying Multiple Aspects of Semistructured Data
Motivated to a large extent by the substantial and growing prominence of the World-Wide Web and the potential benefits that may be obtained by applying database concepts and tech...
Curtis E. Dyreson, Michael H. Böhlen, Christi...