Sciweavers

IADIS
2003

Data Extraction from Web Database Query Result Pages via Tagsets and Integer Sequences

13 years 6 months ago
Data Extraction from Web Database Query Result Pages via Tagsets and Integer Sequences
The World Wide Web is a collection of databases as well as web sites. Databases associated with web sites provide public access via query forms on web pages. They constitute an enormous repository of searchable data on an extremely diverse collection of subjects, ranging from multimedia collections through archives of subject-specific data to current information such as currency conversion or interest rates and news or weather reports. Many interesting and valuable Database Applications could be developed if these databases were easily and reliably accessible to programs. The difficulty in extracting data is the number of different web page formats and the tendency to change format suddenly. A rapid page analysis and wrapper creation system is needed to generate and maintain a data extraction facility for any required web sites. This important goal has been the subject of substantial recent research, modelling the web results page in various ways. The purpose of the current paper is t...
Jerome Robinson
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2003
Where IADIS
Authors Jerome Robinson
Comments (0)