Sciweavers

115 search results - page 4 / 23
» Training Data Cleaning for Text Classification
Sort
View
ECIR
2003
Springer
14 years 11 months ago
A Machine Learning Approach for the Curation of Biomedical Literature
In this paper, we present an automated text classification system for the classification of biomedical papers. This classification is based on whether there is experimental eviden...
Min Shi, David S. Edwin, Rakesh Menon, Lixiang She...
79
Voted
ICDM
2008
IEEE
164views Data Mining» more  ICDM 2008»
15 years 4 months ago
Classifying High-Dimensional Text and Web Data Using Very Short Patterns
In this paper, we propose the "Democratic Classifier", a simple, democracy-inspired patternbased classification algorithm that uses very short patterns for classificatio...
Hassan H. Malik, John R. Kender
AAAI
2006
14 years 11 months ago
Multi-Conditional Learning: Generative/Discriminative Training for Clustering and Classification
This paper presents multi-conditional learning (MCL), a training criterion based on a product of multiple conditional likelihoods. When combining the traditional conditional proba...
Andrew McCallum, Chris Pal, Gregory Druck, Xuerui ...
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
15 years 10 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
TCBB
2010
163views more  TCBB 2010»
14 years 4 months ago
Classification of Protein-Protein Interaction Full-Text Documents Using Text and Citation Network Features
Abstract--We participated (as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of fulltext documents relevant for protein-protein...
Artemy Kolchinsky, Alaa Abi-Haidar, Jasleen Kaur, ...