Sciweavers

587 search results - page 34 / 118
» New Algorithms for Text Fingerprinting
Sort
View
IPM
2006
146views more  IPM 2006»
14 years 9 months ago
Dictionary-based text categorization of chemical web pages
A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...
Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
15 years 10 months ago
Enhanced word clustering for hierarchical text classification
In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...
Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...
IJCNLP
2004
Springer
15 years 3 months ago
A Study of Semi-discrete Matrix Decomposition for LSI in Automated Text Categorization
Abstract. This paper proposes the use of Latent Semantic Indexing (LSI) techniques, decomposed with semi-discrete matrix decomposition (SDD) method, for text categorization. The SD...
Qiang Wang, Xiaolong Wang, Guan Yi
KDD
2007
ACM
176views Data Mining» more  KDD 2007»
15 years 10 months ago
Mining correlated bursty topic patterns from coordinated text streams
Previous work on text mining has almost exclusively focused on a single stream. However, we often have available multiple text streams indexed by the same set of time points (call...
Xuanhui Wang, ChengXiang Zhai, Xiao Hu, Richard Sp...
CIKM
2009
Springer
15 years 4 months ago
Text segmentation via topic modeling: an analytical study
In this paper, the task of text segmentation is approached from a topic modeling perspective. We investigate the use of latent Dirichlet allocation (LDA) topic model to segment a ...
Hemant Misra, François Yvon, Joemon M. Jose...