Sciweavers

218 search results - page 18 / 44
» Crawling for Images on the WWW
Sort
View
WWW
2010
ACM
15 years 4 months ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...
WWW
2011
ACM
14 years 4 months ago
we.b: the web of short urls
Short URLs have become ubiquitous. Especially popular within social networking services, short URLs have seen a significant increase in their usage over the past years, mostly du...
Demetres Antoniades, Iasonas Polakis, Georgios Kon...
WWW
2011
ACM
14 years 4 months ago
Design and implementation of contextual information portals
This paper presents a system for enabling offline web use to satisfy the information needs of disconnected communities. We describe the design, implementation, evaluation, and pil...
Jay Chen, Russell Power, Lakshminarayanan Subraman...
ICIP
2008
IEEE
15 years 11 months ago
Long term learning for image retrieval over networks
In this paper, we present a long term learning system for content based image retrieval over a network. Relevant feedback is used among different sessions to learn both the simila...
David Picard, Arnaud Revel, Matthieu Cord
85
Voted
WWW
2006
ACM
15 years 10 months ago
Finding visual concepts by web image mining
We propose measuring "visualness" of concepts with images on the Web, that is, what extent concepts have visual characteristics. This is a new application of "Web i...
Keiji Yanai, Kobus Barnard