Sciweavers

ICDAR
2011
IEEE

Automatic Estimation of the Legibility of Binarised Historic Documents for Unsupervised Parameter Tuning

12 years 4 months ago
Automatic Estimation of the Legibility of Binarised Historic Documents for Unsupervised Parameter Tuning
—Document enhancement tools are a valuable help in the study of historic documents. Given proper filter settings, many effects that impair the legibility can be evened out (e.g. washed out ink, stained and yellowed paper). However, because of differing authors, languages, handwritings, fonts and paper conditions, no single filter parameter set fits all documents. Therefore, the parameters are usually tuned in a time-consuming manual process to every individual document. To simplify this procedure, this paper introduces a classifier for the legibility of an enhanced historic text document. Experiments on the binarisation of a set of documents from 1938 to 1946 show that the classifier can be used to automatically derive robust filter settings for a variety of documents. Keywords-Document enhancement, historic documents, legibility estimation
Martin Stommel, G. Frieder
Added 24 Dec 2011
Updated 24 Dec 2011
Type Journal
Year 2011
Where ICDAR
Authors Martin Stommel, G. Frieder
Comments (0)