Sciweavers

298 search results - page 20 / 60
» An information-theoretic measure for document similarity
Sort
View
133
Voted
ADC
2008
Springer
139views Database» more  ADC 2008»
15 years 8 months ago
Computing Structural Similarity of Source XML Schemas against Domain XML Schema
In this paper, we study the problem of measuring structural similarities of large number of source schemas against a single domain schema, which is useful for enhancing the qualit...
Jianxin Li, Chengfei Liu, Jeffrey Xu Yu, Jixue Liu...
133
Voted
SIGIR
2002
ACM
15 years 1 months ago
Novelty and redundancy detection in adaptive filtering
This paper addresses the problem of extending an adaptive information filtering system to make decisions about the novelty and redundancy of relevant documents. It argues that rel...
Yi Zhang 0001, James P. Callan, Thomas P. Minka
AIRWEB
2006
Springer
15 years 5 months ago
Tracking Web Spam with Hidden Style Similarity
Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...
Tanguy Urvoy, Thomas Lavergne, Pascal Filoche
96
Voted
ICMCS
2006
IEEE
139views Multimedia» more  ICMCS 2006»
15 years 8 months ago
A Measure for Evaluating Retrieval Techniques based on Partially Ordered Ground Truth Lists
For the RISM A/II collection of musical incipits (short extracts of scores, taken from the beginning), we have established a ground truth based on the opinions of human experts. I...
Rainer Typke, Remco C. Veltkamp, Frans Wiering
106
Voted
GFKL
2005
Springer
142views Data Mining» more  GFKL 2005»
15 years 7 months ago
Near Similarity Search and Plagiarism Analysis
Abstract. Existing methods to text plagiarism analysis mainly base on “chunking”, a process of grouping a text into meaningful units each of which gets encoded by an integer nu...
Benno Stein, Sven Meyer zu Eissen