Sciweavers

JDM
2008

Reverse Engineering from an XML Document into an Extended DTD Graph

13 years 4 months ago
Reverse Engineering from an XML Document into an Extended DTD Graph
The extensible markup language (XML) has become a standard for persistent storage and data interchange via the Internet due to its openness, self-descriptiveness, and flexibility. This article proposes a systematic approach to reverse engineer arbitrary XML documents to their conceptual schema, extended DTD graphs, which are DTD graphs with data semantics. The proposed approach not only determines the structure of the XML document, but also derives candidate data semantics from the XML element instances by treating each XML element instance as a record in a table of a relational database. One application of the determined data semantics is to verify the linkages among elements. Implicit and explicit referential linkages are among XML elements modeled by the parent-children structure and ID/IDREF(S), respectively. As a result, an arbitrary XML document can be reverse engineered into its conceptual schema in an extended DTD graph format.
Herbert Shiu, Joseph Fong
Added 27 Dec 2010
Updated 27 Dec 2010
Type Journal
Year 2008
Where JDM
Authors Herbert Shiu, Joseph Fong
Comments (0)