Search Sciweavers | Sciweavers

704 search results - page 44 / 141

» Semantic Structure Content for Dynamic Web Pages

125

Voted

WWW
2008
ACM

124views Internet Technology» more WWW 2008»

iRobot: an intelligent crawler for web forums

16 years 4 months ago

Download www2008.org

We study in this paper the Web forum crawling problem, which is a very fundamental step in many Web applications, such as search engine and Web data mining. As a typical user-crea...

Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, Lei ...

claim paper

Read More »

163

Voted

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 3 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

143

click to vote

EDBTW
2010
Springer

139views Software Engineering» more EDBTW 2010»

Using visual pages analysis for optimizing web archiving

15 years 2 months ago

Download www-poleia.lip6.fr

Due to the growing importance of the World Wide Web, archiving it has become crucial for preserving useful source of information. To maintain a web archive up-to-date, crawlers ha...

Myriam Ben Saad, Stéphane Gançarski

claim paper

Read More »

153

Voted

DASFAA
2003
IEEE

139views Database» more DASFAA 2003»

Freshness-driven Adaptive Caching for Dynamic Content

15 years 9 months ago

Download www.public.asu.edu

With the wide availability of content delivery networks, many e-commerce Web applications utilize edge cache servers to cache and deliver dynamic contents at locations much closer...

Wen-Syan Li, Oliver Po, Wang-Pin Hsiung, K. Sel&cc...

claim paper

Read More »

135

click to vote

AIIA
2001
Springer

130views Artificial Intelligence» more AIIA 2001»

Evaluation Methods for Focused Crawling

15 years 8 months ago

Download www.dsi.unifi.it

The exponential growth of documents available in the World Wide Web makes it increasingly diﬃcult to discover relevant information on a speciﬁc topic. In this context, growing ...

Andrea Passerini, Paolo Frasconi, Giovanni Soda

claim paper

Read More »

« Prev « First page 44 / 141 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers