Search Sciweavers | Sciweavers

563 search results - page 13 / 113

» Crawling the web for structured documents

173

click to vote

WIKIS
2006
ACM

209views Internet Technology» more WIKIS 2006»

SweetWiki: semantic web enabled technologies in Wiki

15 years 11 months ago

Download www.wikisym.org

Wikis are social web sites enabling a potentially large number of participants to modify any page or create a new page using their web browser. As they grow, wikis may suffer from...

Michel Buffa, Fabien Gandon

claim paper

Read More »

141

Voted

ICWE
2005
Springer

77views Internet Technology» more ICWE 2005»

Identifying Websites with Flow Simulation

15 years 11 months ago

Download pierre.senellart.com

We present in this paper a method to discover the set of webpages contained in a logical website, based on the link structure of the Web graph. Such a method is useful in the conte...

Pierre Senellart

claim paper

Read More »

145

click to vote

WWW
2003
ACM

131views Internet Technology» more WWW 2003»

Dynamic maintenance of web indexes using landmarks

16 years 6 months ago

Download www.research.ibm.com

Recent work on incremental crawling has enabled the indexed document collection of a search engine to be more synchronized with the changing World Wide Web. However, this synchron...

Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey...

claim paper

Read More »

172

click to vote

WWW
2002
ACM

130views Internet Technology» more WWW 2002»

Using web structure for classifying and describing web pages

16 years 6 months ago

Download dpennock.com

The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...

Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...

claim paper

Read More »

145

Voted

CLEF
2005
Springer

115views Information Technology» more CLEF 2005»

EuroGOV: Engineering a Multilingual Web Corpus

15 years 11 months ago

Download www.clef-campaign.org

EuroGOV is a multilingual web corpus that was created to serve as the document collection for WebCLEF, the CLEF 2005 web retrieval task. EuroGOV is a collection of web pages crawl...

Börkur Sigurbjörnsson, Jaap Kamps, Maart...

claim paper

Read More »

« Prev « First page 13 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers