Sciweavers

298 search results - page 4 / 60
» An information-theoretic measure for document similarity
Sort
View
VLDB
2007
ACM
93views Database» more  VLDB 2007»
16 years 2 months ago
Measuring the Structural Similarity of Semistructured Documents Using Entropy
We propose a technique for measuring the structural similarity of semistructured documents based on entropy. After extracting the structural information from two documents we use ...
Sven Helmer
EP
1998
Springer
15 years 6 months ago
Measuring Structural Similarity Among Web Documents: Preliminary Results
When we describe a Web page informally, we often use phrases like it looks like a newspaper site", there are several unordered lists" or it's just a collection of li...
Isabel F. Cruz, Slava Borisov, Michael A. Marks, T...
JDWM
2008
115views more  JDWM 2008»
15 years 1 months ago
Medical Document Clustering Using Ontology-Based Term Similarity Measures
150 words or less
Xiaodan Zhang, Liping Jing, Xiaohua Hu, Michael K....
ICDAR
2003
IEEE
15 years 7 months ago
Document page similarity based on layout visual saliency: Application to query by example and document classification
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Véronique Eglin, Stéphane Bres
IAT
2005
IEEE
15 years 7 months ago
Category-based Similarity Algorithm for Semantic Similarity in Multi-agent Information Sharing Systems
Similarity measures are mechanisms that assign a numeric score indicating how closely two documents, or a document and a query match. The Cosine measure is one of the similarity m...
Sepideh Miralaei, Ali A. Ghorbani