Sciweavers

735 search results - page 42 / 147
» Corpora and data preparation
Sort
View
EMNLP
2009
14 years 7 months ago
Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases
Untranslated words still constitute a major problem for Statistical Machine Translation (SMT), and current SMT systems are limited by the quantity of parallel training texts. Augm...
Yuval Marton, Chris Callison-Burch, Philip Resnik
COLING
2010
14 years 5 months ago
LTP: A Chinese Language Technology Platform
LTP (Language Technology Platform) is an integrated Chinese processing platform which includes a suite of high performance natural language processing (NLP) modules and relevant c...
Wanxiang Che, Zhenghua Li, Ting Liu
DOCENG
2007
ACM
15 years 1 months ago
Extracting reusable document components for variable data printing
Variable Data Printing (VDP) has brought new flexibility and dynamism to the printed page. Each printed instance of a specific class of document can now have different degrees of ...
Steven R. Bagley, David F. Brailsford, James A. Ol...
SCAM
2008
IEEE
15 years 4 months ago
CoordInspector: A Tool for Extracting Coordination Data from Legacy Code
—More and more current software systems rely on non trivial coordination logic for combining autonomous services typically running on different platforms and often owned by diffe...
Nuno F. Rodrigues, Luís Soares Barbosa
BTW
2003
Springer
94views Database» more  BTW 2003»
15 years 3 months ago
Comparative Evaluation of Microarray-based Gene Expression Databases
Microarrays make it possible to monitor the expression of thousands of genes in parallel thus generating huge amounts of data. So far, several databases have been developed for man...
Hong Hai Do, Toralf Kirsten, Erhard Rahm