Sciweavers

167 search results - page 25 / 34
» Text Alignment with Handwritten Documents
Sort
View
ACL
2006
14 years 11 months ago
A DOM Tree Alignment Model for Mining Parallel Data from the Web
This paper presents a new web mining scheme for parallel data acquisition. Based on the Document Object Model (DOM), a web page is represented as a DOM tree. Then a DOM tree align...
Lei Shi, Cheng Niu, Ming Zhou, Jianfeng Gao
57
Voted
ANLP
2000
107views more  ANLP 2000»
14 years 10 months ago
Cut and Paste Based Text Summarization
We present a cut and paste based text summarizer, which uses operations derived from an analhuman written abstracts. The summarizer edits extracted sentences, using reduction to r...
Hongyan Jing, Kathleen McKeown
ICDAR
2007
IEEE
15 years 3 months ago
Content-level Annotation of Large Collection of Printed Document Images
A large annotated corpus is critical to the development of robust optical character recognizers (OCRs). However, creation of annotated corpora is a tedious task. It is laborious, ...
Anand Kumar 0002, C. V. Jawahar
COLING
2002
14 years 9 months ago
A Robust Cross-Style Bilingual Sentences Alignment Model
Most current sentence alignment approaches adopt sentence length and cognate as the alignment features; and they are mostly trained and tested in the documents with the same style...
Tz-Liang Kueng, Keh-Yih Su
70
Voted
ICAIL
2009
ACM
15 years 2 months ago
Segmentation of legal documents
An overwhelming number of legal documents is available in digital form. However, most of the texts are usually only provided in a semi-structured form, i.e. the documents are stru...
Eneldo Loza Mencía