Sciweavers

19 search results - page 1 / 4
» Efficient Phrase-Based Document Similarity for Clustering
Sort
View
TKDE
2008
175views more  TKDE 2008»
13 years 4 months ago
Efficient Phrase-Based Document Similarity for Clustering
Phrase has been considered as a more informative feature term for improving the effectiveness of document clustering. In this paper, we propose a phrase-based document similarity t...
Hung Chim, Xiaotie Deng
WWW
2008
ACM
13 years 4 months ago
A Novelty-based Clustering Method for On-line Documents
In this paper, we describe a document clustering method called noveltybased document clustering. This method clusters documents based on similarity and novelty. The method assigns...
Sophoin Khy, Yoshiharu Ishikawa, Hiroyuki Kitagawa
CIKM
2008
Springer
13 years 6 months ago
Peer-to-peer similarity search over widely distributed document collections
This paper addresses the challenging problem of similarity search over widely distributed ultra-high dimensional data. Such an application is retrieval of the top-k most similar d...
Christos Doulkeridis, Kjetil Nørvåg, ...
CORR
2006
Springer
178views Education» more  CORR 2006»
13 years 4 months ago
A tool set for the quick and efficient exploration of large document collections
: We are presenting a set of multilingual text analysis tools that can help analysts in any field to explore large document collections quickly in order to determine whether the do...
Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, ...
IPM
2006
111views more  IPM 2006»
13 years 4 months ago
Combining preference- and content-based approaches for improving document clustering effectiveness
E-commerce and knowledge management applications generate and consume tremendous amounts of online information that is typically available as textual documents. To facilitate subs...
Chih-Ping Wei, Chin-Sheng Yang, Han-Wei Hsiao, Tsa...