Sciweavers

361 search results - page 10 / 73
» Distributed multi-relational data mining based on genetic al...
Sort
View
IPPS
2000
IEEE
15 years 4 months ago
A Requirements Analysis for Parallel KDD Systems
Abstract. The current generation of data mining tools have limited capacity and performance, since these tools tend to be sequential. This paper explores a migration path out of th...
William Maniatty, Mohammed Javeed Zaki
EDBT
2004
ACM
234views Database» more  EDBT 2004»
15 years 12 months ago
A Condensation Approach to Privacy Preserving Data Mining
In recent years, privacy preserving data mining has become an important problem because of the large amount of personal data which is tracked by many business applications. In many...
Charu C. Aggarwal, Philip S. Yu
80
Voted
ICDM
2006
IEEE
130views Data Mining» more  ICDM 2006»
15 years 5 months ago
Boosting for Learning Multiple Classes with Imbalanced Class Distribution
Classification of data with imbalanced class distribution has posed a significant drawback of the performance attainable by most standard classifier learning algorithms, which ...
Yanmin Sun, Mohamed S. Kamel, Yang Wang 0007
BMCBI
2011
14 years 3 months ago
A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-w
Background: Discovering the genetic basis of common genetic diseases in the human genome represents a public health issue. However, the dimensionality of the genetic data (up to 1...
Raphael Mourad, Christine Sinoquet, Philippe Leray
JDCTA
2010
464views more  JDCTA 2010»
14 years 6 months ago
A New Agglomerative Hierarchical Clustering Algorithm Implementation based on the Map Reduce Framework
Text clustering is one of the difficult and hot research fields in the text mining research. Combing Map Reduce framework and the neuron initialization method of VPSOM (vector pre...
Hui Gao, Jun Jiang, Li She, Yan Fu