Sciweavers

274 search results - page 8 / 55
» On Random Sampling over Joins
Sort
View
PAKDD
2005
ACM
94views Data Mining» more  PAKDD 2005»
15 years 5 months ago
Progressive Sampling for Association Rules Based on Sampling Error Estimation
We explore in this paper a progressive sampling algorithm, called Sampling Error Estimation (SEE), which aims to identify an appropriate sample size for mining association rules. S...
Kun-Ta Chuang, Ming-Syan Chen, Wen-Chieh Yang
97
Voted
PVLDB
2008
110views more  PVLDB 2008»
14 years 11 months ago
Online maintenance of very large random samples on flash storage
Recent advances in flash media have made it an attractive alternative for data storage in a wide spectrum of computing devices, such as embedded sensors, mobile phones, PDA's...
Suman Nath, Phillip B. Gibbons
EDBT
2010
ACM
132views Database» more  EDBT 2010»
15 years 3 months ago
Turbo-charging hidden database samplers with overflowing queries and skew reduction
Recently, there has been growing interest in random sampling from online hidden databases. These databases reside behind form-like web interfaces which allow users to execute sear...
Arjun Dasgupta, Nan Zhang 0004, Gautam Das
ICDE
2006
IEEE
144views Database» more  ICDE 2006»
16 years 1 months ago
Materialized Sample Views for Database Approximation
We consider the problem of creating a sample view of a database table. A sample view is an indexed, materialized view that permits efficient sampling from an arbitrary range query...
Shantanu Joshi, Chris Jermaine
ICDE
2008
IEEE
108views Database» more  ICDE 2008»
16 years 1 months ago
Self-Join Size Estimation in Large-scale Distributed Data Systems
In this work we tackle the open problem of self-join size (SJS) estimation in a large-scale Distributed Data System, where tuples of a relation are distributed over data nodes whic...
Theoni Pitoura, Peter Triantafillou