Sciweavers

711 search results - page 97 / 143
» Applying Support Vector Machines to Imbalanced Datasets
Sort
View
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 6 days ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
CIBCB
2006
IEEE
15 years 5 months ago
A Model-Free Greedy Gene Selection for Microarray Sample Class Prediction
— Microarray data analysis is notoriously challenging as it involves a huge number of genes compared to only a limited number of samples. Gene selection, to detect the most signi...
Yi Shi, Zhipeng Cai, Lizhe Xu, Wei Ren, Randy Goeb...
SGAI
2004
Springer
15 years 5 months ago
Neighbourhood Exploitation in Hypertext Categorization
As the web expands exponentially, the need to put some order to its content becomes apparent. Hypertext categorization, that is the automatic classification of web documents into ...
Houda Benbrahim, Max Bramer
CBMS
2009
IEEE
15 years 3 months ago
A medical image retrieval framework in correlation enhanced visual concept feature space
This paper presents a medical image retrieval framework that uses visual concepts in a feature space employing statistical models built using a probabilistic multi-class support v...
Md. Mahmudur Rahman, Sameer Antani, George R. Thom...
CIKM
2008
Springer
15 years 1 months ago
The role of syntactic features in protein interaction extraction
Most approaches for protein interaction mining from biomedical texts use both lexical and syntactic features. However, the individual impact of these two kinds of features on the ...
Timur Fayruzov, Martine De Cock, Chris Cornelis, V...