Sciweavers

AND
2009

A comprehensive evaluation methodology for noisy historical document recognition techniques

13 years 2 months ago
A comprehensive evaluation methodology for noisy historical document recognition techniques
In this paper, we propose a new comprehensive methodology in order to evaluate the performance of noisy historical document recognition techniques. We aim to evaluate not only the final noisy recognition result but also the main intermediate stages of text line, word and character segmentation. For this purpose, we efficiently create the text line, word and character segmentation ground truth guided by the transcription of the historical documents. The proposed methodology consists of (i) a semiautomatic procedure in order to detect the text line, word and character segmentation ground truth regions making use of the correct document transcription, (ii) calculation of proper evaluation metrics in order to measure the performance of the final OCR result as well as of the intermediate segmentation stages. The semi-automatic procedure for detecting the ground truth regions has been evaluated and proved efficient and time saving. Experimental results prove that using the proposed techniqu...
Nikolaos Stamatopoulos, Georgios Louloudis, Basili
Added 16 Feb 2011
Updated 16 Feb 2011
Type Journal
Year 2009
Where AND
Authors Nikolaos Stamatopoulos, Georgios Louloudis, Basilios Gatos
Comments (0)