Sciweavers

14761 search results - page 238 / 2953
» Optimization in Data Mining
Sort
View
KDD
2009
ACM
198views Data Mining» more  KDD 2009»
16 years 6 months ago
Pervasive parallelism in data mining: dataflow solution to co-clustering large and sparse Netflix data
All Netflix Prize algorithms proposed so far are prohibitively costly for large-scale production systems. In this paper, we describe an efficient dataflow implementation of a coll...
Srivatsava Daruru, Nena M. Marin, Matt Walker, Joy...
KDD
2005
ACM
123views Data Mining» more  KDD 2005»
15 years 11 months ago
Automated detection of frontal systems from numerical model-generated data
Xiang Li, Rahul Ramachandran, Sara J. Graves, Suni...
DMKD
2004
ACM
121views Data Mining» more  DMKD 2004»
15 years 9 months ago
Discovery of ads web hosts through traffic data analysis
One of the most actual problems on web crawling
V. Bacarella, Fosca Giannotti, Mirco Nanni, Dino P...
TKDE
2012
253views Formal Methods» more  TKDE 2012»
13 years 8 months ago
Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
—Preparing a data set for analysis is generally the most time consuming task in a data mining project, requiring many complex SQL queries, joining tables and aggregating columns....
Carlos Ordonez, Zhibo Chen 0002
SIGMOD
1998
ACM
233views Database» more  SIGMOD 1998»
15 years 10 months ago
Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications
Data mining applications place special requirements on clustering algorithms including: the ability to nd clusters embedded in subspaces of high dimensional data, scalability, end...
Rakesh Agrawal, Johannes Gehrke, Dimitrios Gunopul...