Sciweavers

161 search results - page 16 / 33
» Improving Similarity Measures for Short Segments of Text
Sort
View
ICML
1997
IEEE
15 years 1 months ago
A Comparative Study on Feature Selection in Text Categorization
This paper is a comparative study of feature selection methods in statistical learning of text categorization. The focus is on aggressive dimensionality reduction. Five methods we...
Yiming Yang, Jan O. Pedersen
98
Voted
SIGIR
2008
ACM
14 years 9 months ago
Enhancing text clustering by leveraging Wikipedia semantics
Most traditional text clustering methods are based on "bag of words" (BOW) representation based on frequency statistics in a set of documents. BOW, however, ignores the ...
Jian Hu, Lujun Fang, Yang Cao, Hua-Jun Zeng, Hua L...
ICB
2007
Springer
120views Biometrics» more  ICB 2007»
15 years 1 months ago
Online Text-Independent Writer Identification Based on Stroke's Probability Distribution Function
Abstract. This paper introduces a novel method for online writer identification. Traditional methods make use of the distribution of directions in handwritten traces. The novelty o...
Bangyu Li, Zhenan Sun, Tieniu Tan
96
Voted
ITCC
2003
IEEE
15 years 2 months ago
A Method for Calculating Term Similarity on Large Document Collections
We present an efficient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...
Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva
117
Voted
HICSS
2003
IEEE
193views Biometrics» more  HICSS 2003»
15 years 2 months ago
Message Sense Maker: Engineering a Tool Set for Customer Relationship Management
To determine the important trends and issues in thousands of comments from customers and make strategic decisions about business operations, managers must go over these messages m...
Dmitri Roussinov, J. Leon Zhao