Sciweavers

AIIA
2003
Springer

Incremental Induction of Rules for Document Image Understanding

13 years 9 months ago
Incremental Induction of Rules for Document Image Understanding
This paper aims at presenting the application of first-order logic machine learning techniques to two document domains in order to learn rules for recognizing the semantic role of their logical components. Specifically, the multistrategy incremental learning system INTHELEX has been applied to multi-format scientific papers and documents concerning European films from the 20’s and 30’s. The challenge comes from the different levels of formatting standards in these domains: from (more or less) standardized layouts, in scientific papers, to documents with almost no standard, in historical cultural heritage material. Experimental results in both domains and a comparison with the Progol system assess the advantages that the exploitation of INTHELEX can yield.
Stefano Ferilli, Nicola Di Mauro, Teresa Maria Alt
Added 06 Jul 2010
Updated 06 Jul 2010
Type Conference
Year 2003
Where AIIA
Authors Stefano Ferilli, Nicola Di Mauro, Teresa Maria Altomare Basile, Floriana Esposito
Comments (0)