Sciweavers

298 search results - page 13 / 60
» An information-theoretic measure for document similarity
Sort
View
154
Voted
IAJIT
2011
14 years 3 months ago
A hierarchical K-NN classifier for textual data
: This paper presents a classifier that is based on a modified version of the well known K-Nearest Neighbors classifier (K-NN). The original K-NN classifier was adjusted to work wi...
Rehab M. Duwairi, Rania Al-Zubaidi
DMIN
2006
293views Data Mining» more  DMIN 2006»
15 years 1 months ago
Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study
This paper presents the results of classifying Arabic text documents using the N-gram frequency statistics technique employing a dissimilarity measure called the "Manhattan di...
Laila Khreisat
DEXAW
1999
IEEE
106views Database» more  DEXAW 1999»
15 years 4 months ago
Textual Similarities Based on a Distributional Approach
The design of efficient textual similarities is an important issue in the domain of textual data exploration. Textual similarities are for example central in document collection s...
Romaric Besançon, Martin Rajman, Jean-C&eac...
SIGIR
2002
ACM
14 years 11 months ago
Document clustering with committees
Document clustering is useful in many information retrieval tasks: document browsing, organization and viewing of retrieval results, generation of Yahoo-like hierarchies of docume...
Patrick Pantel, Dekang Lin
JUCS
2011
161views more  JUCS 2011»
14 years 6 months ago
Document Retrieval Using SIFT Image Features
: This paper describes a new approach to document classification based on visual features alone. Text-based retrieval systems perform poorly on noisy text. We have conducted serie...
Dan Smith, Richard Harvey