We describe the process of converting plain text cultural heritage data to elements of a domain-specific knowledge base, using general machine learning techniques. First, digitise...
One novel technique for identifying the writer of an online handwritten document is proposed. This technique makes use of a character prototype distribution to model the specific ...
Guo Xian Tan, Christian Viard-Gaudin, Alex ChiChun...
In this paper a morphological tagging approach for document image invoice analysis is described. Tokens close by their morphology and confirmed in their location within different ...
We have combined an artificial neural network (ANN) character classifier with context-driven search over character segmentation, word segmentation, and word recognition hypotheses...
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...