Sciweavers

947 search results - page 152 / 190
» Evaluation of Sampling for Data Mining of Association Rules
Sort
View
AAAI
2007
15 years 2 months ago
Relation Extraction from Wikipedia Using Subtree Mining
The exponential growth and reliability of Wikipedia have made it a promising data source for intelligent systems. The first challenge of Wikipedia is to make the encyclopedia mac...
Dat P. T. Nguyen, Yutaka Matsuo, Mitsuru Ishizuka
KDD
2008
ACM
206views Data Mining» more  KDD 2008»
16 years 28 days ago
Identifying biologically relevant genes via multiple heterogeneous data sources
Selection of genes that are differentially expressed and critical to a particular biological process has been a major challenge in post-array analysis. Recent development in bioin...
Zheng Zhao, Jiangxin Wang, Huan Liu, Jieping Ye, Y...
88
Voted
BMCBI
2010
108views more  BMCBI 2010»
15 years 19 days ago
Comparison of scores for bimodality of gene expression distributions and genome-wide evaluation of the prognostic relevance of h
Background: A major goal of the analysis of high-dimensional RNA expression data from tumor tissue is to identify prognostic signatures for discriminating patient subgroups. For t...
Birte Hellwig, Jan G. Hengstler, Marcus Schmidt, M...
ICDAR
2009
IEEE
15 years 7 months ago
Scalable Feature Extraction from Noisy Documents
We cope with the metadata recognition in layoutoriented documents. We address the problem as a classification task and propose a method for automatic extraction of relevant featu...
Loïc Lecerf, Boris Chidlovskii
125
Voted
SIGMOD
2001
ACM
200views Database» more  SIGMOD 2001»
16 years 20 days ago
Data Bubbles: Quality Preserving Performance Boosting for Hierarchical Clustering
In this paper, we investigate how to scale hierarchical clustering methods (such as OPTICS) to extremely large databases by utilizing data compression methods (such as BIRCH or ra...
Markus M. Breunig, Hans-Peter Kriegel, Peer Kr&oum...