Sciweavers

298 search results - page 31 / 60
» An information-theoretic measure for document similarity
Sort
View
CIKM
2004
Springer
15 years 5 months ago
Soft clustering criterion functions for partitional document clustering: a summary of results
Recently published studies have shown that partitional clustering algorithms that optimize certain criterion functions, which measure key aspects of inter- and intra-cluster simil...
Ying Zhao, George Karypis
ICDAR
2011
IEEE
13 years 11 months ago
Non-rigid Registration and Restoration of Double-Sided Historical Manuscripts
This paper presents a fully automatic framework for the restoration of double-sided historical manuscripts which are impaired by ink bleed-through distortions. First, the recto si...
Jie Wang, Chew Lim Tan
ACL
2010
14 years 9 months ago
Automatic Evaluation of Linguistic Quality in Multi-Document Summarization
To date, few attempts have been made to develop and validate methods for automatic evaluation of linguistic quality in text summarization. We present the first systematic assessme...
Emily Pitler, Annie Louis, Ani Nenkova
KDD
2009
ACM
243views Data Mining» more  KDD 2009»
16 years 10 days ago
Exploiting Wikipedia as external knowledge for document clustering
In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...
Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...
VLDB
2003
ACM
125views Database» more  VLDB 2003»
16 years 1 days ago
THESUS: Organizing Web document collections based on link semantics
Abstract. The requirements for effective search and management of the WWW are stronger than ever. Currently Web documents are classified based on their content not taking into acco...
Maria Halkidi, Benjamin Nguyen, Iraklis Varlamis, ...