Sciweavers

3693 search results - page 143 / 739
» Network Processing of Documents, for Documents, by Documents
Sort
View
ICDAR
1995
IEEE
15 years 3 months ago
A Hough based algorithm for extracting text lines in handwritten documents
The method herein proposed detects text lines on handwritten pages which may include either lines oriented in several directions, erasures, or annotationsbetween main lines. The m...
Laurence Likforman-Sulem, Anahid Hanimyan, Claudie...
IJMMS
2007
107views more  IJMMS 2007»
14 years 11 months ago
Ontologies as facilitators for repurposing web documents
This paper investigates the role of ontologies as a central part of an architecture to repurpose existing material from the web. A prototype system called ArtEquAKT is presented, ...
Mark J. Weal, Harith Alani, Sanghee Kim, Paul H. L...
CIKM
2011
Springer
13 years 11 months ago
Towards noise-resilient document modeling
We introduce a generative probabilistic document model based on latent Dirichlet allocation (LDA), to deal with textual errors in the document collection. Our model is inspired by...
Tao Yang, Dongwon Lee
ICPR
2002
IEEE
16 years 24 days ago
Robust Text Detection from Binarized Document Images
Many document images are rich in color and have complex background. To detect text from them, a standard approach utilizes both color and binary information. This often leads to t...
Oleg Okun, Yu Yan, Matti Pietikäinen
WWW
2007
ACM
16 years 12 days ago
Altering document term vectors for classification: ontologies as expectations of co-occurrence
In this paper we extend the state-of-the-art in utilizing background knowledge for supervised classification by exploiting the semantic relationships between terms explicated in O...
Meenakshi Nagarajan, Amit P. Sheth, Marcos Kawazoe...