Sciweavers

625 search results - page 30 / 125
» Feature selection methods for text classification
Sort
View
CIKM
2010
Springer
14 years 8 months ago
Regularization and feature selection for networked features
In the standard formalization of supervised learning problems, a datum is represented as a vector of features without prior knowledge about relationships among features. However, ...
Hongliang Fei, Brian Quanz, Jun Huan
SMC
2007
IEEE
100views Control Systems» more  SMC 2007»
15 years 5 months ago
Text categorization based on the ratio of word frequency in each categories
— In the present paper, we consider the automatic text categorization as a series of information processing and propose a new classification technique called the Frequency Ratio ...
Makoto Suzuki, Shigeichi Hirasawa
KDD
2002
ACM
179views Data Mining» more  KDD 2002»
15 years 11 months ago
Combining clustering and co-training to enhance text classification using unlabelled data
In this paper, we present a new co-training strategy that makes use of unlabelled data. It trains two predictors in parallel, with each predictor labelling the unlabelled data for...
Bhavani Raskutti, Herman L. Ferrá, Adam Kow...
AUSAI
2007
Springer
15 years 2 months ago
Effectiveness of Methods for Syntactic and Semantic Recognition of Numeral Strings: Tradeoffs Between Number of Features and Len
Abstract. This paper describes and compares the use of methods based on Ngrams (specifically trigrams and pentagrams), together with five features, to recognise the syntactic and s...
Kyongho Min, William H. Wilson, Byeong Ho Kang
KDD
2004
ACM
302views Data Mining» more  KDD 2004»
15 years 11 months ago
Redundancy based feature selection for microarray data
In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands of genes is an important problem for accurate classification of diseas...
Lei Yu, Huan Liu