Sciweavers

62 search results - page 13 / 13
» A Hybrid Approach to Word Segmentation and POS Tagging
Sort
View
CICLING
2009
Springer
13 years 10 months ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus
ICDAR
1997
IEEE
13 years 10 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari