Sciweavers

KES
2010
Springer
13 years 7 months ago
DOCODE-Lite: A Meta-Search Engine for Document Similarity Retrieval
The retrieval of similar documents from large scale datasets has been the one of the main concerns in knowledge management environments, such as plagiarism detection, news impact a...
Felipe Bravo-Marquez, Gaston L'Huillier, Sebasti&a...
CGA
2006
13 years 9 months ago
The Distance-Similarity Metaphor in Region-Display Spatializations
n explore and understand abstract information spaces as if they were real geographic spaces. According to the distance-similarity metaphor1 one of the most popular spatial metaphor...
Sara Irina Fabrikant, Daniel R. Montello, David M....
SODA
2000
ACM
123views Algorithms» more  SODA 2000»
13 years 10 months ago
Communication complexity of document exchange
We address the problem of minimizing the communication involved in the exchange of similar documents. We consider two users, A and B, who hold documents x and y respectively. Neit...
Graham Cormode, Mike Paterson, Süleyman Cenk ...
ECIR
2010
Springer
13 years 11 months ago
Laplacian Co-hashing of Terms and Documents
A promising way to accelerate similarity search is semantic hashing which designs compact binary codes for a large number of documents so that semantically similar documents are ma...
Dell Zhang, Jun Wang, Deng Cai, Jinsong Lu
ICCS
2009
Springer
14 years 4 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov