Sciweavers

1261 search results - page 165 / 253
» Extracting Text from PostScript
Sort
View
BMCBI
2008
139views more  BMCBI 2008»
14 years 10 months ago
Abbreviation definition identification based on automatic precision estimates
Background: The rapid growth of biomedical literature presents challenges for automatic text processing, and one of the challenges is abbreviation identification. The presence of ...
Sunghwan Sohn, Donald C. Comeau, Won Kim, W. John ...
ICASSP
2011
IEEE
14 years 1 months ago
Belief theoretic methods for soft and hard data fusion
In many contexts, one is confronted with the problem of extracting information from large amounts of different types soft data (e.g., text) and hard data (from e.g., physics-based...
Thanuka Wickramarathne, Kamal Premaratne, Manohar ...
ICDAR
2007
IEEE
15 years 4 months ago
Robust Document Warping with Interpolated Vector Fields
This paper describes a new versatile algorithm for correcting nonlinear distortions, such as curvature of book pages, in camera based document processing. We introduce the idea of...
D. Schneider, Marco Block, Raúl Rojas
CAIP
2007
Springer
134views Image Analysis» more  CAIP 2007»
15 years 1 months ago
An Efficient Method for Filtering Image-Based Spam E-mail
Spam e-mail with advertisement text embedded in images presents a great challenge to anti-spam filters. In this paper, we present a fast method to detect image-based spam e-mail. U...
Ngo Phuong Nhung, Tu Minh Phuong
EMNLP
2004
14 years 11 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag