Sciweavers

268 search results - page 5 / 54
» Exploiting Category Information and Document Information to ...
Sort
View
CIKM
2006
Springer
13 years 9 months ago
Multi-task text segmentation and alignment based on weighted mutual information
Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the...
Bingjun Sun, Ding Zhou, Hongyuan Zha, John Yen
MM
2006
ACM
166views Multimedia» more  MM 2006»
13 years 11 months ago
Automatic document orientation detection and categorization through document vectorization
This paper presents an automatic orientation detection and categorization technique that is capable of detecting the orientation of multilingual documents with arbitrary skew and ...
Shijian Lu, Chew Lim Tan
KDD
2009
ACM
243views Data Mining» more  KDD 2009»
14 years 6 months ago
Exploiting Wikipedia as external knowledge for document clustering
In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...
Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...
ISCI
2006
136views more  ISCI 2006»
13 years 5 months ago
Computing with words for text processing: An approach to the text categorization
The use of the computing with words paradigm for the automatic text documents categorization problem is discussed. This specific problem of information retrieval (IR) becomes more...
Slawomir Zadrozny, Janusz Kacprzyk
SAC
2008
ACM
13 years 5 months ago
Discovering relationships among categories using misclassification information
Knowledge of relationships among categories is of the interest in different domains such as text classification, content analysis, and text mining. We propose and evaluate approac...
Saket S. R. Mengle, Nazli Goharian, Alana Platt