Sciweavers

42 search results - page 2 / 9
» A sampling-based framework for parallel data mining
Sort
View
TKDE
2012
278views Formal Methods» more  TKDE 2012»
11 years 7 months ago
Data Cube Materialization and Mining over MapReduce
—Computing interesting measures for data cubes and subsequent mining of interesting cube groups over massive datasets are critical for many important analyses done in the real wo...
Arnab Nandi, Cong Yu, Philip Bohannon, Raghu Ramak...
HPDC
2006
IEEE
13 years 10 months ago
Troubleshooting Distributed Systems via Data Mining
Through massive parallelism, distributed systems enable the multiplication of productivity. Unfortunately, increasing the scale of available machines to users will also multiply d...
David A. Cieslak, Douglas Thain, Nitesh V. Chawla
SDM
2012
SIAM
237views Data Mining» more  SDM 2012»
11 years 7 months ago
A Distributed Kernel Summation Framework for General-Dimension Machine Learning
Kernel summations are a ubiquitous key computational bottleneck in many data analysis methods. In this paper, we attempt to marry, for the first time, the best relevant technique...
Dongryeol Lee, Richard W. Vuduc, Alexander G. Gray
KDD
1997
ACM
111views Data Mining» more  KDD 1997»
13 years 8 months ago
SIPping from the Data Firehose
When mining large databases, the data extraction problem and the interface between the database and data mining algorithm become important issues. Rather than giving a mining algo...
George H. John, Brian Lent
IDEAS
1999
IEEE
175views Database» more  IDEAS 1999»
13 years 9 months ago
A Parallel Scalable Infrastructure for OLAP and Data Mining
Decision support systems are important in leveraging information present in data warehouses in businesses like banking, insurance, retail and health-care among many others. The mu...
Sanjay Goil, Alok N. Choudhary