Sciweavers

3180 search results - page 165 / 636
» Knowledge-based Document Analysis
Sort
View
DAS
2010
Springer
15 years 5 months ago
Overlapped text segmentation using Markov random field and aggregation
Separating machine printed text and handwriting from overlapping text is a challenging problem in the document analysis field and no reliable algorithms have been developed thus f...
Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...
ICDAR
2011
IEEE
14 years 3 months ago
Embedding a Mathematical OCR Module into OCRopus
—This paper describes embedding a mathematical formula recognition module into the OCR system OCRopus aiming at developing a OCR system for scientific and technical documents wh...
Shinpei Yamazaki, Fumihiro Furukori, Qinzheng Zhao...
IJCNLP
2004
Springer
15 years 9 months ago
Combining Labeled and Unlabeled Data for Learning Cross-Document Structural Relationships
Multi-document discourse analysis has emerged with the potential of improving various NLP applications. Based on the newly proposed Cross-document Structure Theory (CST), this pap...
Zhu Zhang, Dragomir R. Radev
IJCAI
2001
15 years 5 months ago
Combining Statistics and Semantics for Word and Document Clustering
A new approach for constructing pseudo-keywords, referred to as Sense Units, is proposed. Sense Units are obtained by a word clustering process, where the underlying similarity re...
Alexandre Termier, Michèle Sebag, Marie-Chr...
ANLP
1994
68views more  ANLP 1994»
15 years 5 months ago
Practical Issues in Automatic Documentation Generation
PLANDoc, a system under joint development by Columbia and Bellcore, documents the activity of planning engineers as they study telephone routes. It takes as input a trace of the e...
Kathleen McKeown, Karen Kukich, James Shaw