Sciweavers

1768 search results - page 240 / 354
» Mining Very Large Databases
Sort
View
SIGMOD
2004
ACM
118views Database» more  SIGMOD 2004»
16 years 3 months ago
Effective Use of Block-Level Sampling in Statistics Estimation
Block-level sampling is far more efficient than true uniform-random sampling over a large database, but prone to significant errors if used to create database statistics. In this ...
Surajit Chaudhuri, Gautam Das, Utkarsh Srivastava
124
Voted
VLDB
2007
ACM
106views Database» more  VLDB 2007»
15 years 9 months ago
Why You Should Run TPC-DS: A Workload Analysis
The Transaction Processing Performance Council (TPC) is completing development of TPC-DS, a new generation industry standard decision support benchmark. The TPC-DS benchmark, firs...
Meikel Pöss, Raghunath Othayoth Nambiar, Davi...
118
Voted
ECCB
2005
IEEE
15 years 9 months ago
SIMAP - The similarity matrix of proteins
Similarity Matrix of Proteins (SIMAP) (http://mips.gsf. 10 de/simap) provides a database based on a precomputed similarity matrix covering the similarity space formed by .4 millio...
Roland Arnold, Thomas Rattei, Patrick Tischler, Mi...
132
Voted
SIGMOD
2010
ACM
321views Database» more  SIGMOD 2010»
15 years 8 months ago
HadoopDB in action: building real world applications
HadoopDB is a hybrid of MapReduce and DBMS technologies, designed to meet the growing demand of analyzing massive datasets on very large clusters of machines. Our previous work ha...
Azza Abouzied, Kamil Bajda-Pawlikowski, Jiewen Hua...
146
Voted
ADBIS
2003
Springer
204views Database» more  ADBIS 2003»
15 years 7 months ago
Hierarchical Bitmap Index: An Efficient and Scalable Indexing Technique for Set-Valued Attributes
Abstract. Set-valued attributes are convenient to model complex objects occurring in the real world. Currently available database systems support the storage of set-valued attribut...
Mikolaj Morzy, Tadeusz Morzy, Alexandros Nanopoulo...