Sciweavers

13 search results - page 1 / 3
» Clustering-based incremental web crawling
Sort
View
TOIS
2010
99views more  TOIS 2010»
13 years 23 hour ago
Clustering-based incremental web crawling
Qingzhao Tan, Prasenjit Mitra
WWW
2008
ACM
14 years 6 months ago
Incremental web page template detection
Most template detection methods process web pages in batches that a newly crawled page can not be processed until enough pages have been collected. This results in large storage c...
Yu Wang, Binxing Fang, Xueqi Cheng, Li Guo, Hongbo...
ICML
2007
IEEE
14 years 6 months ago
Focused crawling with scalable ordinal regression solvers
In this paper we propose a novel, scalable, clustering based Ordinal Regression formulation, which is an instance of a Second Order Cone Program (SOCP) with one Second Order Cone ...
Rashmin Babaria, J. Saketha Nath, S. Krishnan, K. ...
ECIR
2006
Springer
13 years 6 months ago
Efficient Parallel Computation of PageRank
Abstract. PageRank inherently is massively parallelizable and distributable, as a result of web's strict host-based link locality. In this paper we show that the Gau
Christian Kohlschütter, Paul-Alexandru Chirit...
WWW
2007
ACM
14 years 6 months ago
Efficient Update of Indexes for Dynamically Changing Web Documents
Recent work on incremental crawling has enabled the indexed document collection of a search engine to be more synchronized with the changing World Wide Web. However, this synchron...
Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey...