Search Sciweavers | Sciweavers

113 search results - page 3 / 23

» Parallel and Distributed Document Overlap Detection on the W...

click to vote

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

14 years 5 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

click to vote

CIKM
2008
Springer

174views Information Technology» more CIKM 2008»

A language for manipulating clustered web documents results

13 years 6 months ago

Download dblab.cs.nccu.edu.tw

We propose a novel conception language for exploring the results retrieved by several internet search services (like search engines) that cluster retrieved documents. The goal is ...

Gloria Bordogna, Alessandro Campi, Giuseppe Psaila...

claim paper

Read More »

click to vote

CLUSTER
2001
IEEE

137views Distributed And Parallel Com...» more CLUSTER 2001»

Approximation Algorithms for Data Distribution with Load Balancing of Web Servers

13 years 8 months ago

Download www.seas.gwu.edu

Given the increasing traffic on the World Wide Web (Web), it is difficult for a single popular Web server to handle the demand from its many clients. By clustering a group of Web ...

Li-Chuan Chen, Hyeong-Ah Choi

claim paper

Read More »

click to vote

ICDCS
2005
IEEE

128views Distributed And Parallel Com...» more ICDCS 2005»

Using a Layered Markov Model for Distributed Web Ranking Computation

13 years 10 months ago

Download lsirpeople.epfl.ch

The link structure of the Web graph is used in algorithms such as Kleinberg’s HITS and Google’s PageRank to assign authoritative weights to Web pages and thus rank them. Both ...

Jie Wu, Karl Aberer

claim paper

Read More »

click to vote

ICDCS
1998
IEEE

129views Distributed And Parallel Com...» more ICDCS 1998»

A Framework for Consistent, Replicated Web Objects

13 years 8 months ago

Download www.cs.vu.nl

Despite the extensive use of caching techniques, the Web is overloaded. While the caching techniques currently used help some, it would be better to use different caching and repli...

Anne-Marie Kermarrec, Ihor Kuz, Maarten van Steen,...

claim paper

Read More »

« Prev « First page 3 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers