Sciweavers

115 search results - page 5 / 23
» Training Data Cleaning for Text Classification
Sort
View
98
Voted
IPM
2008
196views more  IPM 2008»
14 years 9 months ago
Author identification: Using text sampling to handle the class imbalance problem
Authorship analysis of electronic texts assists digital forensics and anti-terror investigation. Author identification can be seen as a single-label multi-class text categorizatio...
Efstathios Stamatatos
98
Voted
IJCAI
2003
14 years 11 months ago
Integrating Background Knowledge Into Text Classification
We present a description of three different algorithms that use background knowledge to improve text classifiers. One uses the background knowledge as an index into the set of tra...
Sarah Zelikovitz, Haym Hirsh
68
Voted
COLING
2002
14 years 9 months ago
Text Categorization using Feature Projections
This paper proposes a new approach for text categorization, based on a feature projection technique. In our approach, training data are represented as the projections of training ...
Youngjoong Ko, Jungyun Seo
EMNLP
2010
14 years 7 months ago
Cross Language Text Classification by Model Translation and Semi-Supervised Learning
In this paper, we introduce a method that automatically builds text classifiers in a new language by training on already labeled data in another language. Our method transfers the...
Lei Shi, Rada Mihalcea, Mingjun Tian
ICDM
2005
IEEE
163views Data Mining» more  ICDM 2005»
15 years 3 months ago
Efficient Text Classification by Weighted Proximal SVM
In this paper, we present an algorithm that can classify large-scale text data with high classification quality and fast training speed. Our method is based on a novel extension o...
Dong Zhuang, Benyu Zhang, Qiang Yang, Jun Yan, Zhe...