Sciweavers

308 search results - page 5 / 62
» Syntactic Similarity of Web Documents
Sort
View
78
Voted
SPIRE
2004
Springer
15 years 5 months ago
Dealing with Syntactic Variation Through a Locality-Based Approach
To date, attempts for applying syntactic information in the document-based retrieval model dominant have led to little practical improvement, mainly due to the problems associated ...
Jesús Vilares Ferro, Miguel A. Alonso
103
Voted
CPM
2000
Springer
177views Combinatorics» more  CPM 2000»
15 years 4 months ago
Identifying and Filtering Near-Duplicate Documents
Abstract. The mathematical concept of document resemblance captures well the informal notion of syntactic similarity. The resemblance can be estimated using a fixed size “sketch...
Andrei Z. Broder
118
Voted
IPM
2008
141views more  IPM 2008»
14 years 11 months ago
Towards a unified approach to document similarity search using manifold-ranking of blocks
Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches...
Xiaojun Wan, Jianwu Yang, Jianguo Xiao
KES
2010
Springer
14 years 10 months ago
DOCODE-Lite: A Meta-Search Engine for Document Similarity Retrieval
The retrieval of similar documents from large scale datasets has been the one of the main concerns in knowledge management environments, such as plagiarism detection, news impact a...
Felipe Bravo-Marquez, Gaston L'Huillier, Sebasti&a...
79
Voted
IJCNLP
2005
Springer
15 years 5 months ago
A Comparative Study of Language Models for Book and Author Recognition
Abstract. Linguistic information can help improve evaluation of similarity between documents; however, the kind of linguistic information to be used depends on the task. In this pa...
Özlem Uzuner, Boris Katz