Sciweavers

34 search results - page 3 / 7
» Scalable Term Selection for Text Categorization
Sort
View
SAC
2008
ACM
13 years 4 months ago
An efficient feature ranking measure for text categorization
A major obstacle that decreases the performance of text classifiers is the extremely high dimensionality of text data. To reduce the dimension, a number of approaches based on rou...
Songbo Tan, Yuefen Wang, Xueqi Cheng
WWW
2005
ACM
14 years 5 months ago
An experimental study on large-scale web categorization
Taxonomies of the Web typically have hundreds of thousands of categories and skewed category distribution over documents. It is not clear whether existing text classification tech...
Tie-Yan Liu, Yiming Yang, Hao Wan, Qian Zhou, Bin ...
SDM
2009
SIAM
140views Data Mining» more  SDM 2009»
14 years 1 months ago
Straightforward Feature Selection for Scalable Latent Semantic Indexing.
Latent Semantic Indexing (LSI) has been validated to be effective on many small scale text collections. However, little evidence has shown its effectiveness on unsampled large sca...
Jun Yan, Shuicheng Yan, Ning Liu, Zheng Chen
TKDE
2008
111views more  TKDE 2008»
13 years 4 months ago
Text Clustering with Feature Selection by Using Statistical Data
Abstract-- Feature selection is an important method for improving the efficiency and accuracy of text categorization algorithms by removing redundant and irrelevant terms from the ...
Yanjun Li, Congnan Luo, Soon M. Chung
ICML
2003
IEEE
13 years 10 months ago
An Evaluation on Feature Selection for Text Clustering
Feature selection methods have been successfully applied to text categorization but seldom applied to text clustering due to the unavailability of class label information. In this...
Tao Liu, Shengping Liu, Zheng Chen, Wei-Ying Ma