Sciweavers

29 search results - page 4 / 6
» A new suffix tree similarity measure for document clustering
Sort
View
IDEAS
2009
IEEE
192views Database» more  IDEAS 2009»
14 years 13 days ago
A cluster-based approach to XML similarity joins
A natural consequence of the widespread adoption of XML as standard for information representation and exchange is the redundant storage of large amounts of persistent XML documen...
Leonardo Ribeiro, Theo Härder, Fernanda S. Pi...
CSMR
2004
IEEE
13 years 9 months ago
The Weighted Combined Algorithm: A Linkage Algorithm for Software Clustering
Software systems need to evolve as business requirements, technology and environment change. As software is modified to accommodate the required changes, its structure deteriorate...
Onaiza Maqbool, Haroon A. Babri
CORIA
2009
13 years 6 months ago
Clustering en recherche d'information : concentration vs distribution de l'information pertinente
Relying on the Cluster Hypothesis, which states that relevant documents tend to be more similar one to each other than to non-relevant ones, most of information retrieval systems p...
Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...
SIGMOD
2005
ACM
178views Database» more  SIGMOD 2005»
14 years 6 months ago
Towards Effective Indexing for Very Large Video Sequence Database
With rapid advances in video processing technologies and ever fast increments in network bandwidth, the popularity of video content publishing and sharing has made similarity sear...
Heng Tao Shen, Beng Chin Ooi, Xiaofang Zhou
GFKL
2005
Springer
142views Data Mining» more  GFKL 2005»
13 years 11 months ago
Near Similarity Search and Plagiarism Analysis
Abstract. Existing methods to text plagiarism analysis mainly base on “chunking”, a process of grouping a text into meaningful units each of which gets encoded by an integer nu...
Benno Stein, Sven Meyer zu Eissen