Sciweavers

RIAO
1997

Towards Sophisticated Wrapping of Web-based information Repositories

13 years 5 months ago
Towards Sophisticated Wrapping of Web-based information Repositories
Access to on-line information via the Web is exploding. Index and retrieval engines already start to integrate a huge variety of heterogeneous repositories. However, the heterogeneity issue remains, both in terms of the search formats and the formats of the result pages. In this paper we focus on html-based search and result presentations. We discuss our experience in the design, the development and the maintenance of wrappers (in the context of the Knowledge Broker project). We outline different ways to write wrappers, illustrate some of the lessons learned, and conclude by describing a semi-automatic approach for an efficient wrapping of Web-based information repositories. Throughout the paper, we give illustrating examples for hands-on readers. KeyWords World Wide Web; heterogeneous repositories; wrapping; information extraction; rule-based parsing.
Boris Chidlovskii, Uwe M. Borghoff, Pierre-Yves Ch
Added 01 Nov 2010
Updated 01 Nov 2010
Type Conference
Year 1997
Where RIAO
Authors Boris Chidlovskii, Uwe M. Borghoff, Pierre-Yves Chevalier
Comments (0)