Sciweavers

2926 search results - page 139 / 586
» Document Analysis
Sort
View
DAS
2010
Springer
15 years 5 months ago
Investigator name recognition from medical journal articles: a comparative study of SVM and structural SVM
Automated extraction of bibliographic information from journal articles is key to the affordable creation and maintenance of citation databases, such as MEDLINE
Xiaoli Zhang, Jie Zou, Daniel X. Le, George R. Tho...
147
Voted
INFOVIS
2005
IEEE
15 years 9 months ago
Turning the Bucket of Text into a Pipe
Many visual analysis tools operate on a fixed set of data. However, professional information analysts follow issues over a period of time and need to be able to easily add new doc...
Elizabeth G. Hetzler, Vernon L. Crow, Deborah A. P...
DIAL
2006
IEEE
130views Image Analysis» more  DIAL 2006»
15 years 7 months ago
Refinement of digitized documents through recognition of mathematical formulae
We are developing a recognition system, named `Infty', for scientific documents including those with mathematical formulae. In this paper, we propose a new system that can re...
Toshihiro Kanahori, Masakazu Suzuki
EMNLP
2010
15 years 1 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
99
Voted
ICDAR
2009
IEEE
15 years 10 months ago
Hybrid Page Layout Analysis via Tab-Stop Detection
A new hybrid page layout analysis algorithm is proposed, which uses bottom-up methods to form an initial data-type hypothesis and locate the tab-stops that were used when the page...
Raymond W. Smith