Sciweavers

853 search results - page 41 / 171
» Similarity Indexing with the SS-tree
Sort
View
104
Voted
LREC
2008
125views Education» more  LREC 2008»
14 years 11 months ago
Similar Term Discovery using Web Search
We present an approach to the discovery of semantically similar terms that utilizes a web search engine as both a source for generating related terms and a tool for estimating the...
Peter G. Anick, Vijay Murthi, Shaji Sebastian
94
Voted
PVLDB
2010
252views more  PVLDB 2010»
14 years 4 months ago
Efficient and Effective Similarity Search over Probabilistic Data based on Earth Mover's Distance
Probabilistic data is coming as a new deluge along with the technical advances on geographical tracking, multimedia processing, sensor network and RFID. While similarity search is...
Jia Xu, Zhenjie Zhang, Anthony K. H. Tung, Ge Yu
CIVR
2007
Springer
128views Image Analysis» more  CIVR 2007»
15 years 3 months ago
An empirical study of inter-concept similarities in multimedia ontologies
Generic concept detection has been a widely studied topic in recent research on multimedia analysis and retrieval, but the issue of how to exploit the structure of a multimedia on...
Markus Koskela, Alan F. Smeaton
WWW
2006
ACM
15 years 3 months ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
86
Voted
PVLDB
2010
126views more  PVLDB 2010»
14 years 8 months ago
Set Similarity Join on Probabilistic Data
Set similarity join has played an important role in many real-world applications such as data cleaning, near duplication detection, data integration, and so on. In these applicati...
Xiang Lian, Lei Chen 0002