Sciweavers

90 search results - page 9 / 18
» Large-Scale Duplicate Detection for Web Image Search
Sort
View
SAMT
2007
Springer
171views Multimedia» more  SAMT 2007»
15 years 3 months ago
SAPIR: Scalable and Distributed Image Searching
—In this paper we present a scalable and distributed system for image retrieval based on visual features and annotated text. This system is the core of the SAPIR project. Its arc...
Fabrizio Falchi, Mouna Kacimi, Yosi Mass, Fausto R...
PAKDD
2009
ACM
120views Data Mining» more  PAKDD 2009»
15 years 6 months ago
Detecting Link Hijacking by Web Spammers.
Abstract. Since current search engines employ link-based ranking algorithms as an important tool to decide a ranking of sites, Web spammers are making a significant effort to man...
Masaru Kitsuregawa, Masashi Toyoda, Young-joo Chun...
84
Voted
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 4 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
MM
2009
ACM
156views Multimedia» more  MM 2009»
15 years 4 months ago
Understanding near-duplicate videos: a user-centric approach
Popular content in video sharing web sites (e.g., YouTube) is usually duplicated. Most scholars define near-duplicate video clips (NDVC) based on non-semantic features (e.g., di...
Mauro Cherubini, Rodrigo de Oliveira, Nuria Oliver
MCAM
2007
Springer
188views Multimedia» more  MCAM 2007»
15 years 3 months ago
Searching One Billion Web Images by Content: Challenges and Opportunities
Although content-based image retrieval has been studied for decades, most commercial image search engines are still text-based. However, there is a growing demand for techniques to...
Zhiwei Li, Xing Xie, Lei Zhang, Wei-Ying Ma