Sciweavers

ER
2001
Springer

On the Automatic Extraction of Data from the Hidden Web

13 years 8 months ago
On the Automatic Extraction of Data from the Hidden Web
An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are easy and precise) and from a data management perspective (static pages need not be maintained; databases can be accessed directly), automated agents have greater difficulty accessing data behind forms. In this paper we present a method for automatically filling in forms to retrieve the associated dynamically generated pages. Using our approach automated agents can begin to systematically access portions of the “hidden Web.”
Stephen W. Liddle, Sai Ho Yau, David W. Embley
Added 28 Jul 2010
Updated 28 Jul 2010
Type Conference
Year 2001
Where ER
Authors Stephen W. Liddle, Sai Ho Yau, David W. Embley
Comments (0)