Sciweavers

3180 search results - page 143 / 636
» Knowledge-based Document Analysis
Sort
View
217
Voted
DRR
2011
14 years 3 months ago
Improved document image segmentation algorithm using multiresolution morphology
Page segmentation into text and non-text components is an essential preprocessing step before OCR operation. If this is not done properly, an OCR classification engine produces g...
Syed Saqib Bukhari, Faisal Shafait, Thomas M. Breu...
103
Voted
SCIENTOMETRICS
2010
126views more  SCIENTOMETRICS 2010»
15 years 2 months ago
The 12th International conference on scientometrics and informetrics
This paper presents an approach for identifying similar documents that can be used to assist scientists in finding related work. The approach called Citation Proximity Analysis (C...
Jacqueline Leta, Birger Larsen, Ronald Rousseau, W...
142
Voted
CIKM
2010
Springer
15 years 2 months ago
Automatically suggesting topics for augmenting text documents
We present a method for automated topic suggestion. Given a plain-text input document, our algorithm produces a ranking of novel topics that could enrich the input document in a m...
Robert West, Doina Precup, Joelle Pineau
122
Voted
ICDAR
2009
IEEE
15 years 10 months ago
Keyword Spotting in Document Images through Word Shape Coding
With large databases of document images available, a method for users to find keywords in documents will be useful. One approach is to perform Optical Character Recognition (OCR) ...
Shuyong Bai, Linlin Li, Chew Lim Tan
157
Voted
DOCENG
2009
ACM
15 years 7 months ago
Review of automatic document formatting
We review the literature on automatic document formatting with an emphasis on recent work in the field. One common way to frame document formatting is as a constrained optimizatio...
Nathan Hurst, Wilmot Li, Kim Marriott