WWW
15 years 9 months ago
2006 ACM
The web crawler space is often delimited into two general areas: full-web crawling and focused crawling. We present netSifter, a crawler system which integrates features from thes...
94
Voted
WWW
15 years 9 months ago
2006 ACM
Understanding goals and preferences behind a user's online activities can greatly help information providers, such as search engine and E-Commerce web sites, to personalize c...
WWW
15 years 9 months ago
2006 ACM
Ranking methods like PageRank assess the importance of Web pages based on the current state of the rapidly evolving Web graph. The dynamics of the resulting importance scores, how...
74
Voted
WWW
15 years 9 months ago
2006 ACM
Since the publication of Brin and Page's paper on PageRank, many in the Web community have depended on PageRank for the static (query-independent) ordering of Web pages. We s...
88
Voted
WWW
15 years 9 months ago
2006 ACM
XML is fast becoming the standard format to store, exchange and publish over the web, and is getting embedded in applications. Two challenges in handling XML are its size (the XML...
|