Sciweavers

15 search results - page 1 / 3
» Collaborative Web Crawling: Information Gathering Processing...
Sort
View
HICSS
1999
IEEE
178views Biometrics» more  HICSS 1999»
13 years 8 months ago
Collaborative Web Crawling: Information Gathering/Processing over Internet
The main objective of the IBM Grand Central Station (GCS) is to gather information of virtually any type of formats (text, data, image, graphics, audio, video) from the cyberspace...
Shang-Hua Teng, Qi Lu, Matthias Eichstaedt, Daniel...
WWW
2006
ACM
14 years 5 months ago
Geographically focused collaborative crawling
A collaborative crawler is a group of crawling nodes, in which each crawling node is responsible for a specific portion of the web. We study the problem of collecting geographical...
Weizheng Gao, Hyun Chul Lee, Yingbo Miao
ITSSA
2006
581views more  ITSSA 2006»
13 years 4 months ago
Agent-Based Approach for Web Crawling
: Since its creation in 1990, World Wide Web has increased the popularity of Internet which becomes an important source of information or services for all people over the world. Th...
Maxime Wack, Mohamed Bakhouya, Jaafar Gaber
WWW
2007
ACM
14 years 5 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
LAWEB
2003
IEEE
13 years 9 months ago
Cooperative Crawling
Web crawler design presents many different challenges: architecture, strategies, performance and more. One of the most important research topics concerns improving the selection o...
Marina Buzzi