Sciweavers

1083 search results - page 2 / 217
» Efficient Discovery of Confounders in Large Data Sets
Sort
View
CLOUDCOM
2010
Springer
13 years 2 months ago
Efficient Metadata Generation to Enable Interactive Data Discovery over Large-Scale Scientific Data Collections
Discovering the correct dataset efficiently is critical for computations and effective simulations in scientific experiments. In contrast to searching web documents over the Intern...
Sangmi Lee Pallickara, Shrideep Pallickara, Milija...
AAAI
2011
12 years 4 months ago
Relational Blocking for Causal Discovery
Blocking is a technique commonly used in manual statistical analysis to account for confounding variables. However, blocking is not currently used in automated learning algorithms...
Matthew J. Rattigan, Marc E. Maier, David Jensen
PAKDD
2000
ACM
140views Data Mining» more  PAKDD 2000»
13 years 8 months ago
Performance Controlled Data Reduction for Knowledge Discovery in Distributed Databases
The objective of data reduction is to obtain a compact representation of a large data set to facilitate repeated use of non-redundant information with complex and slow learning alg...
Slobodan Vucetic, Zoran Obradovic
IDA
2011
Springer
12 years 11 months ago
A parallel, distributed algorithm for relational frequent pattern discovery from very large data sets
The amount of data produced by ubiquitous computing applications is quickly growing, due to the pervasive presence of small devices endowed with sensing, computing and communicatio...
Annalisa Appice, Michelangelo Ceci, Antonio Turi, ...
SIGKDD
2000
237views more  SIGKDD 2000»
13 years 4 months ago
The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation
Advances in data collection and storage have allowed organizations to create massive, complex and heterogeneous databases, which have stymied traditional methods of data analysis....
Stephen D. Bay, Dennis F. Kibler, Michael J. Pazza...