Sciweavers

DGO
2003

Extending Metadata Definitions by Automatically Extracting and Organizing Glossary Definitions

13 years 6 months ago
Extending Metadata Definitions by Automatically Extracting and Organizing Glossary Definitions
Metadata descriptions of database contents are required to build and use systems that access and deliver data in response to user requests. When numerous heterogeneous databases are brought together in a single system, their various metadata formalizations must be homogenized and integrated in order to support the access planning and delivery system. This integration is a tedious process that requires human expertise and attention. In this paper we describe a method of speeding up the formalization and integration of new metadata. The method takes advantage of the fact that databases are often described in web pages containing natural language glossaries that define pertinent aspects of the data. Given a root URL, our method identifies likely glossaries, extracts and formalizes aspects of relevant concepts defined in them, and automatically integrates the new formalized metadata concepts into a large model of the domain and associated conceptualizations.
Eduard H. Hovy, Andrew Philpot, Judith Klavans, Ul
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2003
Where DGO
Authors Eduard H. Hovy, Andrew Philpot, Judith Klavans, Ulrich Germann, Peter T. Davis
Comments (0)