Sciweavers

57 search results - page 1 / 12
» Evaluation of Text Clustering Algorithms with N-Gram-Based D...
Sort
View
72
Voted
ECIR
2009
Springer
15 years 7 months ago
Evaluation of Text Clustering Algorithms with N-Gram-Based Document Fingerprints
This paper presents a new approach designed to reduce the computational load of the existing clustering algorithms by trimming down the documents size using fingerprinting methods...
Javier Parapar, Alvaro Barreiro
73
Voted
CIKM
2008
Springer
15 years 7 days ago
Winnowing-based text clustering
We present an approach to document clustering based on winnowing fingerprints that achieved good values of effectiveness with considerable save in memory space and computation tim...
Javier Parapar, Alvaro Barreiro
186
Voted
JCDL
2011
ACM
374views Education» more  JCDL 2011»
14 years 1 months ago
Comparative evaluation of text- and citation-based plagiarism detection approaches using guttenplag
Various approaches for plagiarism detection exist. All are based on more or less sophisticated text analysis methods such as string matching, fingerprinting or style comparison. I...
Bela Gipp, Norman Meuschke, Jöran Beel
87
Voted
CICLING
2008
Springer
15 years 7 days ago
Evaluation of Internal Validity Measures in Short-Text Corpora
Short texts clustering is one of the most difficult tasks in natural language processing due to the low frequencies of the document terms. We are interested in analysing these kind...
Diego Ingaramo, David Pinto, Paolo Rosso, Marcelo ...
CIKM
2006
Springer
15 years 2 months ago
Incremental hierarchical clustering of text documents
Incremental hierarchical text document clustering algorithms are important in organizing documents generated from streaming on-line sources, such as, Newswire and Blogs. However, ...
Nachiketa Sahoo, Jamie Callan, Ramayya Krishnan, G...