Sciweavers

483 search results - page 14 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
EMNLP
2010
14 years 9 months ago
Cross Language Text Classification by Model Translation and Semi-Supervised Learning
In this paper, we introduce a method that automatically builds text classifiers in a new language by training on already labeled data in another language. Our method transfers the...
Lei Shi, Rada Mihalcea, Mingjun Tian
NAACL
2007
15 years 1 months ago
Using "Annotator Rationales" to Improve Machine Learning for Text Categorization
We propose a new framework for supervised machine learning. Our goal is to learn from smaller amounts of supervised training data, by collecting a richer kind of training data: an...
Omar Zaidan, Jason Eisner, Christine D. Piatko
ICDM
2005
IEEE
163views Data Mining» more  ICDM 2005»
15 years 5 months ago
Efficient Text Classification by Weighted Proximal SVM
In this paper, we present an algorithm that can classify large-scale text data with high classification quality and fast training speed. Our method is based on a novel extension o...
Dong Zhuang, Benyu Zhang, Qiang Yang, Jun Yan, Zhe...
FLAIRS
2008
15 years 2 months ago
Building Useful Models from Imbalanced Data with Sampling and Boosting
Building useful classification models can be a challenging endeavor, especially when training data is imbalanced. Class imbalance presents a problem when traditional classificatio...
Chris Seiffert, Taghi M. Khoshgoftaar, Jason Van H...
DOCENG
2007
ACM
15 years 3 months ago
Adapting associative classification to text categorization
Associative classification, which originates from numerical data mining, has been applied to deal with text data recently. Text data is firstly digitalized to database of transact...
Baoli Li, Neha Sugandh, Ernest V. Garcia, Ashwin R...