Sciweavers

735 search results - page 41 / 147
» Corpora and data preparation
Sort
View
LREC
2010
152views Education» more  LREC 2010»
14 years 11 months ago
English-Spanish Large Statistical Dictionary of Inflectional Forms
The paper presents an approach for constructing a weighted bilingual dictionary of inflectional forms using as input data a traditional bilingual dictionary, and not parallel corp...
Grigori Sidorov, Alberto Barrón-Cedeñ...
LREC
2008
90views Education» more  LREC 2008»
14 years 11 months ago
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Parallel corpora are critical resources for machine translation research and development since parallel corpora contain translation equivalences of various granularities. Manual a...
Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing...
EACL
2003
ACL Anthology
14 years 11 months ago
Detecting Novel Compounds: The Role of Distributional Evidence
Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurrence in a corpus is indicative of their terminological status, and has not addr...
Mirella Lapata, Alex Lascarides
GRAPHICSINTERFACE
1997
14 years 11 months ago
Interactive Volumetric Information Visualization
This paper describes a minimallyimmersive threedimensional volumetric interactive information visualizationsystem formanagementand analysis ofdocument corpora. The system, SFA, us...
David S. Ebert, Christopher D. Shaw, Amen Zwa, Eth...
NAACL
2010
14 years 8 months ago
Stream-based Translation Models for Statistical Machine Translation
Typical statistical machine translation systems are trained with static parallel corpora. Here we account for scenarios with a continuous incoming stream of parallel training data...
Abby Levenberg, Chris Callison-Burch, Miles Osborn...