Sciweavers

36 search results - page 1 / 8
» Measuring Structural Similarity Among Web Documents: Prelimi...
Sort
View
EP
1998
Springer
13 years 8 months ago
Measuring Structural Similarity Among Web Documents: Preliminary Results
When we describe a Web page informally, we often use phrases like it looks like a newspaper site", there are several unordered lists" or it's just a collection of li...
Isabel F. Cruz, Slava Borisov, Michael A. Marks, T...
WEBI
2007
Springer
13 years 10 months ago
Extending Link-based Algorithms for Similar Web Pages with Neighborhood Structure
The problem of finding similar pages to a given web page arises in many web applications such as search engine. In this paper, we focus on the link-based similarity measures whic...
Zhenjiang Lin, Michael R. Lyu, Irwin King
WWW
2007
ACM
14 years 5 months ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
PODS
2008
ACM
211views Database» more  PODS 2008»
14 years 4 months ago
The power of two min-hashes for similarity search among hierarchical data objects
In this study we propose sketching algorithms for computing similarities between hierarchical data. Specifically, we look at data objects that are represented using leaf-labeled t...
Sreenivas Gollapudi, Rina Panigrahy
AUSDM
2008
Springer
230views Data Mining» more  AUSDM 2008»
13 years 6 months ago
Combining Structure and Content Similarities for XML Document Clustering
This paper proposes a clustering approach that explores both the content and the structure of XML documents for determining similarity among them. Assuming that the content and th...
Tien Tran, Richi Nayak, Peter Bruza