Sciweavers

2513 search results - page 45 / 503
» Improving Generalization by Data Categorization
Sort
View
KDD
2007
ACM
152views Data Mining» more  KDD 2007»
16 years 7 days ago
Relational data pre-processing techniques for improved securities fraud detection
Commercial datasets are often large, relational, and dynamic. They contain many records of people, places, things, events and their interactions over time. Such datasets are rarel...
Andrew Fast, Lisa Friedland, Marc Maier, Brian Tay...
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
16 years 7 days ago
Knowledge transfer via multiple model local structure mapping
The effectiveness of knowledge transfer using classification algorithms depends on the difference between the distribution that generates the training examples and the one from wh...
Jing Gao, Wei Fan, Jing Jiang, Jiawei Han
IDEAS
2008
IEEE
80views Database» more  IDEAS 2008»
15 years 6 months ago
Improved count suffix trees for natural language data
With more and more natural language text stored in databases, handling respective query predicates becomes very important. Optimizing queries with predicates includes (sub)string ...
Guido Sautter, Cristina Abba, Klemens Böhm
BMCBI
2008
104views more  BMCBI 2008»
14 years 12 months ago
Missing value imputation improves clustering and interpretation of gene expression microarray data
Background: Missing values frequently pose problems in gene expression microarray experiments as they can hinder downstream analysis of the datasets. While several missing value i...
Johannes Tuikkala, Laura Elo, Olli Nevalainen, Ter...
CIVR
2008
Springer
125views Image Analysis» more  CIVR 2008»
15 years 1 months ago
Leveraging user query log: toward improving image data clustering
Image clustering is useful in many retrieval and classification applications. The main goal of image clustering is to partition a given dataset into salient clusters such that the...
Hao Cheng, Kien A. Hua, Khanh Vu