Sciweavers

385 search results - page 29 / 77
» Improving data mining utility with projective sampling
Sort
View
ADMA
2006
Springer
131views Data Mining» more  ADMA 2006»
15 years 3 months ago
Distance Guided Classification with Gene Expression Programming
Gene Expression Programming (GEP) aims at discovering essential rules hidden in observed data and expressing them mathematically. GEP has been proved to be a powerful tool for cons...
Lei Duan, Changjie Tang, Tianqing Zhang, Dagang We...
90
Voted
KDD
2006
ACM
222views Data Mining» more  KDD 2006»
15 years 10 months ago
A component-based framework for knowledge discovery in bioinformatics
Motivation: In the field of bioinformatics there is an emerging need to integrate all knowledge discovery steps into a standardized modular framework. Indeed, component-based deve...
Julien Etienne, Bernd Wachmann, Lei Zhang
178
Voted
ICDE
2005
IEEE
118views Database» more  ICDE 2005»
15 years 11 months ago
A Framework for High-Accuracy Privacy-Preserving Mining
To preserve client privacy in the data mining process, a variety of techniques based on random perturbation of individual data records have been proposed recently. In this paper, ...
Shipra Agrawal, Jayant R. Haritsa
SIGMOD
2001
ACM
200views Database» more  SIGMOD 2001»
15 years 9 months ago
Data Bubbles: Quality Preserving Performance Boosting for Hierarchical Clustering
In this paper, we investigate how to scale hierarchical clustering methods (such as OPTICS) to extremely large databases by utilizing data compression methods (such as BIRCH or ra...
Markus M. Breunig, Hans-Peter Kriegel, Peer Kr&oum...
BMCBI
2010
133views more  BMCBI 2010»
14 years 9 months ago
Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction
Background: With the rapid expansion of DNA sequencing databases, it is now feasible to identify relevant information from prior sequencing projects and completed genomes and appl...
Lance E. Palmer, Mathäus Dejori, Randall A. B...