Sciweavers

86 search results - page 2 / 18
» Measuring similarity of semi-structured documents with conte...
Sort
View
CIKM
2001
Springer
13 years 9 months ago
Query-Sensitive Similarity Measures for the Calculation of Interdocument Relationships
The application of document clustering to information retrieval has been motivated by the potential effectiveness gains postulated by the Cluster Hypothesis. The hypothesis states ...
Anastasios Tombros, C. J. van Rijsbergen
ICDM
2002
IEEE
162views Data Mining» more  ICDM 2002»
13 years 9 months ago
Phrase-based Document Similarity Based on an Index Graph Model
Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...
Khaled M. Hammouda, Mohamed S. Kamel
PODS
2008
ACM
211views Database» more  PODS 2008»
14 years 5 months ago
The power of two min-hashes for similarity search among hierarchical data objects
In this study we propose sketching algorithms for computing similarities between hierarchical data. Specifically, we look at data objects that are represented using leaf-labeled t...
Sreenivas Gollapudi, Rina Panigrahy
ICDAR
2003
IEEE
13 years 10 months ago
Optimizing Binary Feature Vector Similarity Measure using Genetic Algorithm and Handwritten Character Recognition
Classifying an unknown input is a fundamental problem in pattern recognition. A common method is to define a distance metric between patterns and find the most similar pattern i...
Sung-Hyuk Cha, Charles C. Tappert, Sargur N. Sriha...
AUSDM
2008
Springer
230views Data Mining» more  AUSDM 2008»
13 years 6 months ago
Combining Structure and Content Similarities for XML Document Clustering
This paper proposes a clustering approach that explores both the content and the structure of XML documents for determining similarity among them. Assuming that the content and th...
Tien Tran, Richi Nayak, Peter Bruza