An Approach to Extracting the Target Text Line from a Document Image Captured by a Pen Scanner

15 years 10 months ago

Download www.cse.salford.ac.uk

In this paper, we present a new approach to extracting the target text line from a document image captured by a pen scanner. Given the binary image, a set of possible text lines are ﬁrst formed by nearest-neighbor grouping of connected components (CC). They are then reﬁned by text line merging and adding the missed CCs. The possible target text line is identiﬁed by using a geometric feature based score function and fed to an OCR engine for character recognition. If the recognition result is conﬁdent enough, the target text line is accepted. Otherwise, all the remaining text lines are fed to the OCR engine to verify whether an alternative target text line exists or the whole image should be rejected. The effectiveness of the above approach is conﬁrmed by experiments on a testing database consisting of 117 document images captured by C-Pen and ScanEye pen scanners.

Zhen-Long Bai, Qiang Huo

Real-time Traffic

Document Analysis | ICDAR 2003 | Possible Target Text | Target Text Line | Text Line |

claim paper

Post Info
More Details (n/a)

Added	04 Jul 2010
Updated	04 Jul 2010
Type	Conference
Year	2003
Where	ICDAR
Authors	Zhen-Long Bai, Qiang Huo

Comments (0)

Sciweavers

An Approach to Extracting the Target Text Line from a Document Image Captured by a Pen Scanner

Document Analysis | ICDAR 2003 | Possible Target Text | Target Text Line | Text Line |

Explore & Download

Productivity Tools

Sciweavers