Sciweavers

1261 search results - page 38 / 253
» Extracting Text from PostScript
Sort
View
ACL
1998
14 years 11 months ago
Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text
A method is presented for automatically augmenting the bilingual lexicon of an existing Machine Translation system, by extracting bilingual entries from aligned bilingual text. Th...
Davide Turcato
DOCENG
2009
ACM
15 years 4 months ago
Web document text and images extraction using DOM analysis and natural language processing
: © Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing Parag Mulendra Joshi, Sam Liu HP Laboratories HPL-2009-187 Web page text extraction,...
Parag Mulendra Joshi, Sam Liu
DEXAW
2010
IEEE
166views Database» more  DEXAW 2010»
14 years 10 months ago
Thesaurus Based Term Ranking for Keyword Extraction
In many cases keywords from a restricted set of possible keywords have to be assigned to texts. A common way to find the best keywords is to rank terms occurring in the text accord...
Luit Gazendam, Christian Wartena, Rogier Brussee
JAIR
2010
160views more  JAIR 2010»
14 years 8 months ago
Constructing Reference Sets from Unstructured, Ungrammatical Text
Vast amounts of text on the Web are unstructured and ungrammatical, such as classified ads, auction listings, forum postings, etc. We call such text “posts.” Despite their in...
Matthew Michelson, Craig A. Knoblock
EMNLP
2010
14 years 7 months ago
Hierarchical Phrase-Based Translation Grammars Extracted from Alignment Posterior Probabilities
We report on investigations into hierarchical phrase-based translation grammars based on rules extracted from posterior distributions over alignments of the parallel text. Rather ...
Adrià de Gispert, Juan Pino, William J. Byr...