Sciweavers

7 search results - page 2 / 2
» Automatic Processing of Large Corpora for the Resolution of ...
Sort
View
CICLING
2008
Springer
13 years 6 months ago
Non-interactive OCR Post-correction for Giga-Scale Digitization Projects
This paper proposes a non-interactive system for reducing the level of OCR-induced typographical variation in large text collections, contemporary and historical. Text-Induced Corp...
Martin Reynaert
EMNLP
2009
13 years 2 months ago
Sinuhe - Statistical Machine Translation using a Globally Trained Conditional Exponential Family Translation Model
We present a new phrase-based conditional exponential family translation model for statistical machine translation. The model operates on a feature representation in which sentenc...
Matti Kääriäinen