Adaptive Web Data Extraction Policies

13 years 10 months ago

Download cab.unime.it

Web data extraction is concerned, among other things, with routine data accessing and downloading from continuously-updated dynamic Web pages. There is a relevant trade-off between the rate at which the external Web sites are accessed and the computational burden on the accessing client. We address the problem by proposing a predictive model, typical of the Operating Systems literature, of the rate-of-update of each Web source. The presented model has been implemented into a new version of the Dynamo project: a middleware that assists in generating informative RSS feeds out of traditional HTML Web sites. To be effective, i.e., make RSS feeds be timely and informative and to be scalable, Dynamo needs a careful tuning and customization of its polling policies, which are described in detail.

Giacomo Fiumara, Massimo Marchi, Alessandro Provet

Real-time Traffic

POLICY 2007 | Routine Data Accessing | Rss Feeds | Web Sites |

claim paper

» Adapter Generation for Extracting and Querying Data from Web

» Composition of Qualitative Adaptation Policies

» Adaptive record extraction from web pages

» Modeling correlations in web traces and implications for designing replacement policies

» Adaptive Mobile Interfaces through Grammar Induction

» Accurately and Reliably Extracting Data from the Web A Machine Learning Approach

» Reliable and Adaptable Security Engineering for DatabaseWeb Services

» A privacy preserving web recommender system

Post Info
More Details (n/a)

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	POLICY
Authors	Giacomo Fiumara, Massimo Marchi, Alessandro Provetti

Comments (0)

Sciweavers

Adaptive Web Data Extraction Policies

POLICY 2007 | Routine Data Accessing | Rss Feeds | Web Sites |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers