Sciweavers

SIGMOD
2001
ACM
108views Database» more  SIGMOD 2001»
14 years 4 months ago
Improving Index Performance through Prefetching
This paper proposes and evaluates Prefetching B+ -Trees pB+ -Trees, which use prefetching to accelerate two important operations on B+ -Tree indices: searches and range scans. To ...
Shimin Chen, Phillip B. Gibbons, Todd C. Mowry
SIGMOD
2001
ACM
124views Database» more  SIGMOD 2001»
14 years 4 months ago
Query Optimization In Compressed Database Systems
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk access rates by orders of magnitude, enabling the use of data compression techn...
Zhiyuan Chen, Johannes Gehrke, Flip Korn
SIGMOD
2001
ACM
229views Database» more  SIGMOD 2001»
14 years 4 months ago
A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries
The ability to approximately answer aggregation queries accurately and efficiently is of great benefit for decision support and data mining tools. In contrast to previous sampling...
Surajit Chaudhuri, Gautam Das, Vivek R. Narasayya
SIGMOD
2001
ACM
106views Database» more  SIGMOD 2001»
14 years 4 months ago
Enabling Dynamic Content Caching for Database-Driven Web Sites
K. Selçuk Candan, Wen-Syan Li, Qiong Luo, W...
SIGMOD
2001
ACM
84views Database» more  SIGMOD 2001»
14 years 4 months ago
STHoles: A Multidimensional Workload-Aware Histogram
Nicolas Bruno, Surajit Chaudhuri, Luis Gravano
SIGMOD
2001
ACM
200views Database» more  SIGMOD 2001»
14 years 4 months ago
Data Bubbles: Quality Preserving Performance Boosting for Hierarchical Clustering
In this paper, we investigate how to scale hierarchical clustering methods (such as OPTICS) to extremely large databases by utilizing data compression methods (such as BIRCH or ra...
Markus M. Breunig, Hans-Peter Kriegel, Peer Kr&oum...
SIGMOD
2001
ACM
145views Database» more  SIGMOD 2001»
14 years 4 months ago
Automatic Segmentation of Text into Structured Records
In this paper we present a method for automatically segmenting unformatted text records into structured elements. Several useful data sources today are human-generated as continuo...
Vinayak R. Borkar, Kaustubh Deshmukh, Sunita Saraw...
SIGMOD
2001
ACM
193views Database» more  SIGMOD 2001»
14 years 4 months ago
Epsilon Grid Order: An Algorithm for the Similarity Join on Massive High-Dimensional Data
The similarity join is an important database primitive which has been successfully applied to speed up applications such as similarity search, data analysis and data mining. The s...
Christian Böhm, Bernhard Braunmüller, Fl...
SIGMOD
2001
ACM
142views Database» more  SIGMOD 2001»
14 years 4 months ago
Outlier Detection for High Dimensional Data
The outlier detection problem has important applications in the eld of fraud detection, network robustness analysis, and intrusion detection. Most such applications are high dimen...
Charu C. Aggarwal, Philip S. Yu