Sciweavers

298 search results - page 3 / 60
» An information-theoretic measure for document similarity
Sort
View
SIGIR
2006
ACM
14 years 7 days ago
Measuring similarity of semi-structured documents with context weights
In this work, we study similarity measures for text-centric XML documents based on an extended vector space model, which considers both document content and structure. Experimenta...
Christopher C. Yang, Nan Liu
SIGIR
2003
ACM
13 years 11 months ago
An information-theoretic measure for document similarity
Recent work has demonstrated that the assessment of pairwise object similarity can be approached in an axiomatic manner using information theory. We extend this concept specifica...
Javed A. Aslam, Meredith Frost
AMR
2005
Springer
128views Multimedia» more  AMR 2005»
13 years 11 months ago
Ranking Invariance Based on Similarity Measures in Document Retrieval
Abstract. To automatically retrieve documents or images from a database, retrieval systems use similarity measures to compare a request based on features extracted from the documen...
Jean-François Omhover, Maria Rifqi, Marcin ...
NAACL
2007
13 years 7 months ago
Document Similarity Measures to Distinguish Native vs. Non-Native Essay Writers
The ability to distinguish statistically different populations of speakers or writers can be an important asset in many NLP applications. In this paper, we describe a method of us...
Olga Gurevich, Paul Deane
DASFAA
2007
IEEE
240views Database» more  DASFAA 2007»
14 years 18 days ago
A Comparative Study of Ontology Based Term Similarity Measures on PubMed Document Clustering
Recent research shows that ontology as background knowledge can improve document clustering quality with its concept hierarchy knowledge. Previous studies take term semantic simila...
Xiaodan Zhang, Liping Jing, Xiaohua Hu, Michael K....