Sciweavers

218 search results - page 9 / 44
» Crawling for Images on the WWW
Sort
View
61
Voted
WWW
2008
ACM
15 years 10 months ago
Incremental web page template detection
Most template detection methods process web pages in batches that a newly crawled page can not be processed until enough pages have been collected. This results in large storage c...
Yu Wang, Binxing Fang, Xueqi Cheng, Li Guo, Hongbo...
WWW
2003
ACM
15 years 10 months ago
Automatic Profile Generation in eRACE
In this paper, we describe the design of a profile generator toolkit, which aims to automatically create realistic user profiles for a mobile personalized portal service. These pr...
Christiana Christophi, Marios D. Dikaiakos
53
Voted
WWW
2002
ACM
15 years 10 months ago
Parallel crawlers
In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...
Junghoo Cho, Hector Garcia-Molina
MVA
1998
134views Computer Vision» more  MVA 1998»
14 years 11 months ago
Orientation and Scale Invariant Text Region Extraction in WWW Images
Text extraction from a web image is important for web indexing because the text can contain a key information of the web. This paper presents a method to detect a text with variou...
Taehoon Park, Dongsung Kim, Kyusik Chung
APWEB
2005
Springer
15 years 3 months ago
Indexing Text and Visual Features for WWW Images
In this paper, we present a novel indexing technique called Multi-scale Similarity Indexing (MSI) to index image’s multi-features into a single one-dimensional structure. Both f...
Heng Tao Shen, Xiaofang Zhou, Bin Cui