Search Sciweavers | Sciweavers

38 search results - page 2 / 8

» The indexable web is more than 11.5 billion pages

click to vote

VLDB
2000
ACM

104views Database» more VLDB 2000»

The Evolution of the Web and Implications for an Incremental Crawler

14 years 28 days ago

Download rose.cs.ucla.edu

In this paper we study how to build an effective incremental crawler. The crawler selectively and incrementally updates its index and/or local collection of web pages, instead of ...

Junghoo Cho, Hector Garcia-Molina

claim paper

Read More »

click to vote

ICTIR
2009
Springer

97views Information Technology» more ICTIR 2009»

PageRank: Splitting Homogeneous Singular Linear Systems of Index One

14 years 3 months ago

Download pubs.doc.ic.ac.uk

Abstract. The PageRank algorithm is used today within web information retrieval to provide a content-neutral ranking metric over web pages. It employs power method iterations to so...

Douglas V. de Jager, Jeremy T. Bradley

claim paper

Read More »

click to vote

SIGIR
2008
ACM

133views Information Technology» more SIGIR 2008»

Classifiers without borders: incorporating fielded text from neighboring web pages

13 years 9 months ago

Download www.cse.lehigh.edu

Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...

Xiaoguang Qi, Brian D. Davison

claim paper

Read More »

click to vote

LREC
2010

149views Education» more LREC 2010»

DutchParl. The Parliamentary Documents in Dutch

13 years 10 months ago

Download ilps.science.uva.nl

A corpus called DutchParl is created which aims to contain all digitally available parliamentary documents written in the Dutch language. The first version of DutchParl contains d...

Maarten Marx, Anne Schuth

claim paper

Read More »

click to vote

WEBDB
2005
Springer

129views Database» more WEBDB 2005»

Searching for Hidden-Web Databases

14 years 2 months ago

Download www.cs.utah.edu

Recently, there has been increased interest in the retrieval and integration of hidden Web data with a view to leverage high-quality information available in online databases. Alt...

Luciano Barbosa, Juliana Freire

claim paper

Read More »

« Prev « First page 2 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers