Sciweavers

1261 search results - page 41 / 253
» Extracting Text from PostScript
Sort
View
LWA
2008
14 years 11 months ago
Rule-Based Information Extraction for Structured Data Acquisition using TextMarker
Information extraction is concerned with the location of specific items in (unstructured) textual documents, e.g., being applied for the acquisition of structured data. Then, the ...
Martin Atzmüller, Peter Klügl, Frank Pup...
INLG
2010
Springer
14 years 7 months ago
Extracting Parallel Fragments from Comparable Corpora for Data-to-text Generation
Building NLG systems, in particular statistical ones, requires parallel data (paired inputs and outputs) which do not generally occur naturally. In this paper, we investigate the ...
Anja Belz, Eric Kow
ACL
2009
14 years 7 months ago
Extracting Paraphrases of Technical Terms from Noisy Parallel Software Corpora
In this paper, we study the problem of extracting technical paraphrases from a parallel software corpus, namely, a collection of duplicate bug reports. Paraphrase acquisition is a...
Xiaoyin Wang, David Lo, Jing Jiang, Lu Zhang, Hong...
IRAL
2003
ACM
15 years 3 months ago
A practical text summarizer by paragraph extraction for Thai
In this paper, we propose a practical approach for extracting the most relevant paragraphs from the original document to form a summary for Thai text. The idea of our approach is ...
Chuleerat Jaruskulchai, Canasai Kruengkrai
ACL
2001
14 years 11 months ago
Japanese Information Extraction with Automatically Extracted Patterns
One of the central issues for information extraction (IE) systems is the cost of customization from one scenario to another. Research on the automated acquisition of patterns is i...
Kiyoshi Sudo