Sciweavers

DEXAW
2007
IEEE

Classifying XML Documents by Using Genre Features

13 years 10 months ago
Classifying XML Documents by Using Genre Features
The categorization of documents is traditionally topic-based. This paper presents a complementary analysis of research and experiments on genre to show that encouraging results can be obtained by using genre structure (form) features. We conducted an experiment to assess the effectiveness of using extensible mark-up language (XML) tag information, and part-of-speech (P-O-S) features, for the classification of genres, testing the hypothesis that if a focus on genre can lead to high precision on normal textual documents, then good results can be achieved using XML tag information in addition to P-O-S information. An experiment was carried out on a subsection of the initiative for the evaluation of XML
Malcolm Clark, Stuart N. K. Watt
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where DEXAW
Authors Malcolm Clark, Stuart N. K. Watt
Comments (0)