Sciweavers

298 search results - page 16 / 60
» An information-theoretic measure for document similarity
Sort
View
WWW
2008
ACM
16 years 15 days ago
Web graph similarity for anomaly detection (poster)
Web graphs are approximate snapshots of the web, created by search engines. Their creation is an error-prone procedure that relies on the availability of Internet nodes and the fa...
Panagiotis Papadimitriou 0002, Ali Dasdan, Hector ...
ACL
2006
15 years 1 months ago
Names and Similarities on the Web: Fact Extraction in the Fast Lane
In a new approach to large-scale extraction of facts from unstructured text, distributional similarities become an integral part of both the iterative acquisition of high-coverage...
Marius Pasca, Dekang Lin, Jeffrey Bigham, Andrei L...
CIKM
2004
Springer
15 years 5 months ago
Swoogle: a search and metadata engine for the semantic web
Swoogle is a crawler-based indexing and retrieval system for the Semantic Web documents – i.e., RDF or OWL documents. It analyzes the documents it discovered to compute useful m...
Li Ding, Timothy W. Finin, Anupam Joshi, Rong Pan,...
PODS
2008
ACM
211views Database» more  PODS 2008»
15 years 12 months ago
The power of two min-hashes for similarity search among hierarchical data objects
In this study we propose sketching algorithms for computing similarities between hierarchical data. Specifically, we look at data objects that are represented using leaf-labeled t...
Sreenivas Gollapudi, Rina Panigrahy
IDEAS
2009
IEEE
192views Database» more  IDEAS 2009»
15 years 6 months ago
A cluster-based approach to XML similarity joins
A natural consequence of the widespread adoption of XML as standard for information representation and exchange is the redundant storage of large amounts of persistent XML documen...
Leonardo Ribeiro, Theo Härder, Fernanda S. Pi...