Sciweavers

298 search results - page 39 / 60
» An information-theoretic measure for document similarity
Sort
View
SPIRE
2004
Springer
15 years 5 months ago
Dealing with Syntactic Variation Through a Locality-Based Approach
To date, attempts for applying syntactic information in the document-based retrieval model dominant have led to little practical improvement, mainly due to the problems associated ...
Jesús Vilares Ferro, Miguel A. Alonso
CICLING
2010
Springer
15 years 3 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
ICPR
2006
IEEE
16 years 27 days ago
A General Framework for Agglomerative Hierarchical Clustering Algorithms
This paper presents a general framework for agglomerative hierarchical clustering based on graphs. Specifying an inter-cluster similarity measure, a subgraph of the similarity gra...
Reynaldo Gil-García, José Manuel Bad...
CAE
2007
15 years 2 months ago
Extracting the Essence from Sets of Images
We use a set of photographs showing similar scenes as a model for a single photograph this scene. A distance measure for this model is defined by correlating the neigborhoods of p...
Marc Alexa
SIGIR
2010
ACM
14 years 6 months ago
Three web-based heuristics to determine a person's or institution's country of origin
We propose three heuristics to determine the country of origin of a person or institution via text-based IE from the Web. We evaluate all methods on a collection of music artists ...
Markus Schedl, Klaus Seyerlehner, Dominik Schnitze...