Sciweavers

DOCENG
2003
ACM

Methods for the semantic analysis of document markup

13 years 9 months ago
Methods for the semantic analysis of document markup
We present an approach on how to investigate what kind of semantic information is regularly associated with the structural markup of scientific articles. This approach addresses the need for an explicit formal description of the semantics of text-oriented XML-documents. The domain of our investigation is a corpus of scientific articles from psychology and linguistics from both English and German online available journals. For our analyses, we provide XML-markup representing two kinds of semantic levels: the thematic level (i.e. topics in the text world that the article is about) and the functional or rhetorical level. Our hypothesis is that these semantic levels correlate with the articles’ document structure also represented in XML. Articles have been annotated with the appropriate information. Each of the three informational levels is modelled in a separate XML document, since in our domain, the different description levels might conflict so that it is impossible to model them...
Petra Saskia Bayerl, Harald Lüngen, Daniela G
Added 05 Jul 2010
Updated 05 Jul 2010
Type Conference
Year 2003
Where DOCENG
Authors Petra Saskia Bayerl, Harald Lüngen, Daniela Goecke, Andreas Witt, Daniel Naber
Comments (0)