Sciweavers

86 search results - page 5 / 18
» Measuring similarity of semi-structured documents with conte...
Sort
View
SIGIR
2010
ACM
15 years 1 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
BMCBI
2010
201views more  BMCBI 2010»
14 years 7 months ago
IntelliGO: a new vector-based semantic similarity measure including annotation origin
Background: The Gene Ontology (GO) is a well known controlled vocabulary describing the biological process, molecular function and cellular component aspects of gene annotation. I...
Sidahmed Benabderrahmane, Malika Smaïl-Tabbon...
ITCC
2003
IEEE
15 years 2 months ago
A Method for Calculating Term Similarity on Large Document Collections
We present an efficient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...
Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva
DIAL
2006
IEEE
146views Image Analysis» more  DIAL 2006»
14 years 11 months ago
Distance Measures for Layout-Based Document Image Retrieval
Most methods for document image retrieval rely solely on text information to find similar documents. This paper describes a way to use layout information for document image retrie...
Joost van Beusekom, Daniel Keysers, Faisal Shafait...
AIED
2009
Springer
15 years 4 months ago
Assessing Student Paraphrases Using Lexical Semantics and Word Weighting
We present in this paper an approach to assessing student paraphrases in the intelligent tutoring system iSTART. The approach is based on measuring the semantic similarity between ...
Vasile Rus, Mihai C. Lintean, Arthur C. Graesser, ...