Sciweavers

2936 search results - page 331 / 588
» Genetic Process Mining
Sort
View
160
Voted
IPPS
2010
IEEE
15 years 2 months ago
Improving MapReduce performance through data placement in heterogeneous Hadoop clusters
MapReduce has become an important distributed processing model for large-scale data-intensive applications like data mining and web indexing. Hadoop
Jiong Xie, Shu Yin, Xiaojun Ruan, Zhiyang Ding, Yu...
172
Voted
BMCBI
2008
141views more  BMCBI 2008»
15 years 4 months ago
Ontology-guided data preparation for discovering genotype-phenotype relationships
Complexity of post-genomic data and multiplicity of mining strategies are two limits to Knowledge Discovery in Databases (KDD) in life sciences. Because they provide a semantic fr...
Adrien Coulet, Malika Smaïl-Tabbone, Pascale ...
KDD
2006
ACM
117views Data Mining» more  KDD 2006»
16 years 5 months ago
Efficient multidimensional data representations based on multiple correspondence analysis
In the On Line Analytical Processing (OLAP) context, exploration of huge and sparse data cubes is a tedious task which does not always lead to efficient results. In this paper, we...
Omar Boussaid, Riadh Ben Messaoud, Sabine Loudcher...
KDD
2005
ACM
122views Data Mining» more  KDD 2005»
16 years 5 months ago
Pattern lattice traversal by selective jumps
Regardless of the frequent patterns to discover, either the full frequent patterns or the condensed ones, either closed or maximal, the strategy always includes the traversal of t...
Osmar R. Zaïane, Mohammad El-Hajj
PAKDD
2009
ACM
151views Data Mining» more  PAKDD 2009»
15 years 11 months ago
Budget Semi-supervised Learning
In this paper we propose to study budget semi-supervised learning, i.e., semi-supervised learning with a resource budget, such as a limited memory insufficient to accommodate and/...
Zhi-Hua Zhou, Michael Ng, Qiao-Qiao She, Yuan Jian...