Most text categorization methods require text content of documents that is often difficult to obtain. We consider "Collaborative Text Categorization", where each document...
In this paper we introduce a machine learning approach for automatic text segmentation. Our text segmenter clusters text-segments containing similar concepts. It first discovers th...
We propose a novel approach for finding text in images by using ridges at several scales. A text string is modelled by a ridge at a coarse scale representing its center line and n...
We propose a method for constructing a vector for a document image to represent its content to facilitate text retrieval. The method is based on an N-Gram algorithm for text simil...
Abstract. Text interpretation can be considered as the process of extracting deep-level semantics from unstructured text documents. Deeplevel semantics represent abstract index str...
Irma Sofia Espinosa Peraldi, Atila Kaya, Sylvia Me...