Sciweavers

JIIS
2000

Machine Learning for Intelligent Processing of Printed Documents

13 years 4 months ago
Machine Learning for Intelligent Processing of Printed Documents
Abstract. A paper document processing system is an information system component which transforms information on printed or handwritten documents into a computer-revisable form. In intelligent systems for paper document processing this information capture process is based on knowledge of the specific layout and logical structures of the documents. This article proposes the application of machine learning techniques to acquire the specific knowledge required by an intelligent document processing system, named WISDOM++, that manages printed documents, such as letters and journals. Knowledge is represented by means of decision trees and firstorder rules automatically generated from a set of training documents. In particular, an incremental decision tree learning system is applied for the acquisition of decision trees used for the classification of segmented blocks, while a first-order learning system is applied for the induction of rules used for the layout-based classification and underst...
Floriana Esposito, Donato Malerba, Francesca A. Li
Added 19 Dec 2010
Updated 19 Dec 2010
Type Journal
Year 2000
Where JIIS
Authors Floriana Esposito, Donato Malerba, Francesca A. Lisi
Comments (0)