Sciweavers

140 search results - page 3 / 28
» PaDDMAS: Parallel and Distributed Data Mining Application Su...
Sort
View
PPOPP
2005
ACM
13 years 11 months ago
A sampling-based framework for parallel data mining
The goal of data mining algorithm is to discover useful information embedded in large databases. Frequent itemset mining and sequential pattern mining are two important data minin...
Shengnan Cong, Jiawei Han, Jay Hoeflinger, David A...
HPDC
2006
IEEE
13 years 11 months ago
Troubleshooting Distributed Systems via Data Mining
Through massive parallelism, distributed systems enable the multiplication of productivity. Unfortunately, increasing the scale of available machines to users will also multiply d...
David A. Cieslak, Douglas Thain, Nitesh V. Chawla
IPPS
1998
IEEE
13 years 10 months ago
High Performance Data Mining Using Data Cubes on Parallel Computers
On-Line Analytical Processing techniques are used for data analysis and decision support systems. The multidimensionality of the underlying data is well represented by multidimens...
Sanjay Goil, Alok N. Choudhary
IPPS
1998
IEEE
13 years 10 months ago
Hiding Communication Latency in Data Parallel Applications
Interprocessor communication times can be a significant fraction of the overall execution time required for data parallel applications. Large communication to computation ratios o...
Vivek Garg, David E. Schimmel
IPPS
2010
IEEE
13 years 3 months ago
Improving MapReduce performance through data placement in heterogeneous Hadoop clusters
MapReduce has become an important distributed processing model for large-scale data-intensive applications like data mining and web indexing. Hadoop
Jiong Xie, Shu Yin, Xiaojun Ruan, Zhiyang Ding, Yu...