pages | Sciweavers

8

LAWEB
2003
IEEE

96views Internet Technology» more LAWEB 2003»

On the Evolution of Clusters of Near-Duplicate Web Pages

13 years 9 months ago

This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...

Dennis Fetterly, Mark Manasse, Marc Najork

claim paper

Read More »

11

click to vote

WIDM
2005
ACM

125views Internet Technology» more WIDM 2005»

DirectoryRank: ordering pages in web directories

13 years 10 months ago

Download nike.psu.edu

Web Directories are repositories of Web pages organized in a hierarchy of topics and sub-topics. In this paper, we present DirectoryRank, a ranking framework that orders the pages...

Vlassis Krikos, Sofia Stamou, Pavlos Kokosis, Alex...

claim paper

Read More »

9

click to vote

SIGMOD
2005
ACM

126views Database» more SIGMOD 2005»

Page Quality: In Search of an Unbiased Web Ranking

13 years 10 months ago

Download oak.cs.ucla.edu

In a number of recent studies [4, 8] researchers have found that because search engines repeatedly return currently popular pages at the top of search results, popular pages tend ...

Junghoo Cho, Sourashis Roy, Robert Adams

claim paper

Read More »

18

click to vote

WISE
2005
Springer

151views Internet Technology» more WISE 2005»

Extracting Web Data Using Instance-Based Learning

13 years 10 months ago

Download www.cs.uic.edu

This paper studies structured data extraction from Web pages, e.g., online product description pages. Existing approaches to data extraction include wrapper induction and automatic...

Yanhong Zhai, Bing Liu

claim paper

Read More »

22

click to vote

WISE
2005
Springer

204views Internet Technology» more WISE 2005»

Temporal Ranking of Search Engine Results

13 years 10 months ago

Download www.dl.kuis.kyoto-u.ac.jp

Existing search engines contain the picture of the Web from the past and their ranking algorithms are based on data crawled some time ago. However, a user requires not only relevan...

Adam Jatowt, Yukiko Kawai, Katsumi Tanaka

claim paper

Read More »

30

click to vote

FIRBPERF
2005
IEEE

260views Algorithms» more FIRBPERF 2005»

Models of Dynamic Web Content

13 years 10 months ago

Download mafalda.unipv.it

Web pages are created, modiﬁed and removed at unspeciﬁed times by their owners. The frequency and extent of changes to Web pages vary across sites and across pages within site...

Mariacarla Calzarossa, Daniele Tessera

claim paper

Read More »

11

click to vote

LAWEB
2007
IEEE

91views Internet Technology» more LAWEB 2007»

Distinctive Features of the Argentinian Web

13 years 10 months ago

Download www.chato.cl

This article presents the most distinguishing features of the Argentinian web as found in a private sample of almost 10 million web pages from 150.000 sites collected in the early...

Gabriel Tolosa, Fernando Bordignon, Ricardo A. Bae...

claim paper

Read More »

12

click to vote

ICDE
2007
IEEE

142views Database» more ICDE 2007»

An Automatic Page Link Generation Method based on Users' Behavior

13 years 10 months ago

Download ccs.njit.edu

In this paper, we propose a novel method for generating personalized page links. The page links which are generated by our proposed method are useful if users look for web pages r...

Yu Suzuki, Keigo Nakatani, Kyoji Kawagoe

claim paper

Read More »

14

click to vote

ESA
2009
Springer

99views Algorithms» more ESA 2009»

Minimizing Maximum Response Time and Delay Factor in Broadcast Scheduling

13 years 11 months ago

Download www.cs.illinois.edu

We consider online algorithms for pull-based broadcast scheduling. In this setting there are n pages of information at a server and requests for pages arrive online. When the serv...

Chandra Chekuri, Sungjin Im, Benjamin Moseley

claim paper

Read More »

14

click to vote

WWW
2010
ACM

272views Internet Technology» more WWW 2010»

Diversifying web search results

13 years 11 months ago

Download webdocs.cs.ualberta.ca

Result diversity is a topic of great importance as more facets of queries are discovered and users expect to ﬁnd their desired facets in the ﬁrst page of the results. However,...

Davood Rafiei, Krishna Bharat, Anand Shukla

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers