Search Sciweavers | Sciweavers

178 search results - page 3 / 36

» Scheduling Algorithms for Web Crawling

click to vote

WWW
2008
ACM

109views Internet Technology» more WWW 2008»

Recrawl scheduling based on information longevity

14 years 6 months ago

Download www2008.org

It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...

Christopher Olston, Sandeep Pandey

claim paper

Read More »

click to vote

WWW
2003
ACM

219views Internet Technology» more WWW 2003»

Adaptive on-line page importance computation

14 years 6 months ago

Download mainline.brynmawr.edu

The computation of page importance in a huge dynamic graph has recently attracted a lot of attention because of the web. Page importance, or page rank is defined as the fixpoint o...

Serge Abiteboul, Mihai Preda, Gregory Cobena

claim paper

Read More »

click to vote

DMIN
2007

183views Data Mining» more DMIN 2007»

Crawling Attacks Against Web-based Recommender Systems

13 years 7 months ago

Download maya.cs.depaul.edu

—User proﬁles derived from Web navigation data are used in important e-commerce applications such as Web personalization, recommender systems, and Web analytics. In the open en...

Runa Bhaumik, Robin D. Burke, Bamshad Mobasher

claim paper

Read More »

click to vote

WWW
2009
ACM

153views Internet Technology» more WWW 2009»

Sitemaps: above and beyond the crawl of duty

14 years 6 months ago

Download www2009.eprints.org

Comprehensive coverage of the public web is crucial to web search engines. Search engines use crawlers to retrieve pages and then discover new ones by extracting the pages' o...

Uri Schonfeld, Narayanan Shivakumar

claim paper

Read More »

click to vote

ICDE
2007
IEEE

167views Database» more ICDE 2007»

DSphere: A Source-Centric Approach to Crawling, Indexing and Searching the World Wide Web

14 years 7 months ago

Download www.cc.gatech.edu

We describe DSPHERE1 - a decentralized system for crawling, indexing, searching and ranking of documents in the World Wide Web. Unlike most of the existing search technologies tha...

Bhuvan Bamba, Ling Liu, James Caverlee, Vaibhav Pa...

claim paper

Read More »

« Prev « First page 3 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers