Sciweavers

2135 search results - page 165 / 427
» Database Paper - The IRI Marketing Data Set
Sort
View
KDD
2009
ACM
239views Data Mining» more  KDD 2009»
16 years 6 months ago
Tell me something I don't know: randomization strategies for iterative data mining
There is a wide variety of data mining methods available, and it is generally useful in exploratory data analysis to use many different methods for the same dataset. This, however...
Heikki Mannila, Kai Puolamäki, Markus Ojala, ...
285
Voted
ICDE
2006
IEEE
164views Database» more  ICDE 2006»
16 years 7 months ago
New Sampling-Based Estimators for OLAP Queries
One important way in which sampling for approximate query processing in a database environment differs from traditional applications of sampling is that in a database, it is feasi...
Ruoming Jin, Leonid Glimcher, Chris Jermaine, Gaga...
KDD
2006
ACM
164views Data Mining» more  KDD 2006»
16 years 6 months ago
Assessing data mining results via swap randomization
The problem of assessing the significance of data mining results on high-dimensional 0?1 data sets has been studied extensively in the literature. For problems such as mining freq...
Aristides Gionis, Heikki Mannila, Panayiotis Tsapa...
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
16 years 6 months ago
Mining phenotypes and informative genes from gene expression data
Mining microarray gene expression data is an important research topic in bioinformatics with broad applications. While most of the previous studies focus on clustering either gene...
Chun Tang, Aidong Zhang, Jian Pei
160
Voted
DMIN
2008
152views Data Mining» more  DMIN 2008»
15 years 7 months ago
PCS: An Efficient Clustering Method for High-Dimensional Data
Clustering algorithms play an important role in data analysis and information retrieval. How to obtain a clustering for a large set of highdimensional data suitable for database ap...
Wei Li 0011, Cindy Chen, Jie Wang