Sciweavers

LPNMR
2001
Springer

Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto

13 years 9 months ago
Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto
Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting information from Web pages using such wrappers, and for translating the extracted content into XML. This paper describes some advanced features of Lixto, such as disjunctive pattern definitions, specialization rules, and Lixto’s capability of collecting and aggregating information from several linked Web pages.
Robert Baumgartner, Sergio Flesca, Georg Gottlob
Added 30 Jul 2010
Updated 30 Jul 2010
Type Conference
Year 2001
Where LPNMR
Authors Robert Baumgartner, Sergio Flesca, Georg Gottlob
Comments (0)