Sciweavers

SIGMOD
2004
ACM
161views Database» more  SIGMOD 2004»
14 years 5 months ago
Approximation Techniques for Spatial Data
Spatial Database Management Systems (SDBMS), e.g., Geographical Information Systems, that manage spatial objects such as points, lines, and hyper-rectangles, often have very high ...
Abhinandan Das, Johannes Gehrke, Mirek Riedewald
SIGMOD
2004
ACM
106views Database» more  SIGMOD 2004»
14 years 5 months ago
An Indexing Framework for Peer-to-Peer Systems
Adina Crainiceanu, Prakash Linga, Ashwin Machanava...
SIGMOD
2004
ACM
144views Database» more  SIGMOD 2004»
14 years 5 months ago
Diamond in the Rough: Finding Hierarchical Heavy Hitters in Multi-Dimensional Data
Data items archived in data warehouses or those that arrive online as streams typically have attributes which take values from multiple hierarchies (e.g., time and geographic loca...
Graham Cormode, Flip Korn, S. Muthukrishnan, Dives...
SIGMOD
2004
ACM
196views Database» more  SIGMOD 2004»
14 years 5 months ago
FARMER: Finding Interesting Rule Groups in Microarray Datasets
Microarray datasets typically contain large number of columns but small number of rows. Association rules have been proved to be useful in analyzing such datasets. However, most e...
Gao Cong, Anthony K. H. Tung, Xin Xu, Feng Pan, Ji...
SIGMOD
2004
ACM
95views Database» more  SIGMOD 2004»
14 years 5 months ago
Querying at Internet-Scale
We are developing a distributed query processor called PIER, which is designed to run on the scale of the entire Internet. PIER utilizes a Distributed Hash Table (DHT) as its comm...
Brent N. Chun, Joseph M. Hellerstein, Ryan Huebsch...
SIGMOD
2004
ACM
100views Database» more  SIGMOD 2004»
14 years 5 months ago
Cost-Based Labeling of Groups of Mass Spectra
We make two main contributions in this paper. First, we motivate and introduce a novel class of data mining problems that arise in labeling a group of mass spectra, specifically f...
Lei Chen 0003, Zheng Huang, Raghu Ramakrishnan
SIGMOD
2004
ACM
124views Database» more  SIGMOD 2004»
14 years 5 months ago
BLAS: An Efficient XPath Processing System
We present BLAS , a Bi-LAbeling based System, for efficiently processing complex XPath queries over XML data. BLAS uses Plabeling to process queries involving consecutive child ax...
Yi Chen, Susan B. Davidson, Yifeng Zheng
SIGMOD
2004
ACM
118views Database» more  SIGMOD 2004»
14 years 5 months ago
Effective Use of Block-Level Sampling in Statistics Estimation
Block-level sampling is far more efficient than true uniform-random sampling over a large database, but prone to significant errors if used to create database statistics. In this ...
Surajit Chaudhuri, Gautam Das, Utkarsh Srivastava
SIGMOD
2004
ACM
123views Database» more  SIGMOD 2004»
14 years 5 months ago
Automatic Categorization of Query Results
Exploratory ad-hoc queries could return too many answers ? a phenomenon commonly referred to as "information overload". In this paper, we propose to automatically catego...
Kaushik Chakrabarti, Surajit Chaudhuri, Seung-won ...
SIGMOD
2004
ACM
116views Database» more  SIGMOD 2004»
14 years 5 months ago
Efficient Development of Data Migration Transformations
Paulo J. F. Carreira, Helena Galhardas