Sciweavers

76 search results - page 15 / 16
» Pseudo-Aligned Multilingual Corpora
Sort
View
CICLING
2008
Springer
13 years 7 months ago
Portuguese Pronoun Resolution: Resources and Evaluation
Despite being one of the most widely-spoken languages in the world, Portuguese remains a relatively resource-poor language, for which only in recently years NLP tools such as parse...
Ramon Ré Moya Cuevas, Willian Yukio Honda, ...
AAAI
2008
13 years 7 months ago
Cross-lingual Propagation for Morphological Analysis
Multilingual parallel text corpora provide a powerful means for propagating linguistic knowledge across languages. We present a model which jointly learns linguistic structure for...
Benjamin Snyder, Regina Barzilay
LREC
2008
141views Education» more  LREC 2008»
13 years 6 months ago
New Resources for Document Classification, Analysis and Translation Technologies
The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...
Stephanie Strassel, Lauren Friedman, Safa Ismael, ...
LREC
2008
114views Education» more  LREC 2008»
13 years 6 months ago
Improving Statistical Machine Translation Efficiency by Triangulation
In current phrase-based Statistical Machine Translation systems, more training data is generally better than less. However, a larger data set eventually introduces a larger model ...
Yu Chen, Andreas Eisele, Martin Kay
EMNLP
2007
13 years 6 months ago
Bootstrapping Feature-Rich Dependency Parsers with Entropic Priors
One may need to build a statistical parser for a new language, using only a very small labeled treebank together with raw text. We argue that bootstrapping a parser is most promis...
David A. Smith, Jason Eisner