Sciweavers

CORR
2010
Springer
215views Education» more  CORR 2010»
13 years 4 months ago
Text Classification using the Concept of Association Rule of Data Mining
As the amount of online text increases, the demand for text classification to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of...
Chowdhury Mofizur Rahman, Ferdous Ahmed Sohel, Par...
SIGIR
2010
ACM
13 years 4 months ago
SED: supervised experimental design and its application to text classification
In recent years, active learning methods based on experimental design achieve state-of-the-art performance in text classification applications. Although these methods can exploit ...
Yi Zhen, Dit-Yan Yeung
PRIS
2004
13 years 5 months ago
Effect of Feature Smoothing Methods in Text Classification Tasks
Abstract. The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds o...
David Vilar, Hermann Ney, Alfons Juan, Enrique Vid...
CLIN
2001
13 years 5 months ago
Accurate Stemming of Dutch for Text Classification
This paper investigates the use of stemming for classification of Dutch (email) texts. We introduce a stemmer, which combines dictionary lookup (implemented efficiently as a finit...
Tanja Gaustad, Gosse Bouma
CASCON
2001
148views Education» more  CASCON 2001»
13 years 5 months ago
Email classification with co-training
The main problems in text classification are lack of labeled data, as well as the cost of labeling the unlabeled data. We address these problems by exploring co-training - an algo...
Svetlana Kiritchenko, Stan Matwin
FLAIRS
2006
13 years 5 months ago
Using Web Searches on Important Words to Create Background Sets for LSI Classification
The world wide web has a wealth of information that is related to almost any text classification task. This paper presents a method for mining the web to improve text classificati...
Sarah Zelikovitz, Marina Kogan
FLAIRS
2004
13 years 5 months ago
Automatic Generation of Background Text to Aid Classification
We illustrate that Web searches can often be utilized to generate background text for use with text classification. This is the case because there are frequently many pages on the...
Sarah Zelikovitz, Robert Hafner
DMIN
2006
114views Data Mining» more  DMIN 2006»
13 years 5 months ago
Towards Using Fewer Features for Text Classification
Abstract-- Text classification or categorization is a conventional classification problem applied to the text domain. In the cases when statistical classification methods are used,...
Yuan Yuan, Tianyang Gu
ICMLA
2008
13 years 6 months ago
Text Classification Using Tree Kernels and Linguistic Information
Standard Machine Learning approaches to text classification use the bag-of-words representation of documents to deceive the classification target function. Typical linguistic stru...
Teresa Gonçalves, Paulo Quaresma
ECIR
2008
Springer
13 years 6 months ago
Semi-supervised Document Classification with a Mislabeling Error Model
Abstract. This paper investigates a new extension of the Probabilistic Latent Semantic Analysis (PLSA) model [6] for text classification where the training set is partially labeled...
Anastasia Krithara, Massih-Reza Amini, Jean-Michel...