Sciweavers

735 search results - page 85 / 147
» Corpora and data preparation
Sort
View
COLING
2008
14 years 11 months ago
A Local Alignment Kernel in the Context of NLP
This paper discusses local alignment kernels in the context of the relation extraction task. We define a local alignment kernel based on the Smith-Waterman measure as a sequence s...
Sophia Katrenko, Pieter W. Adriaans
LREC
2010
138views Education» more  LREC 2010»
14 years 11 months ago
BAStat : New Statistical Resources at the Bavarian Archive for Speech Signals
A new type of language resource 'BAStat' has been released by the Bavarian Archive for Speech Signals. In contrast to primary resources like speech and text corpora BASt...
Florian Schiel
LREC
2010
164views Education» more  LREC 2010»
14 years 11 months ago
Enhanced Infrastructure for Creation and Collection of Translation Resources
Statistical Machine Translation (MT) systems have achieved impressive results in recent years, due in large part to the increasing availability of parallel text for system trainin...
Zhiyi Song, Stephanie Strassel, Gary Krug, Kazuaki...
LREC
2010
171views Education» more  LREC 2010»
14 years 11 months ago
Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal)
In this paper, we report on the design of a part-of-speech-tagset for Wolof and on the creation of a semi-automatically annotated gold standard. The main motivation for this resou...
Cheikh M. Bamba Dione, Jonas Kuhn, Sina Zarrie&szl...
EMNLP
2008
14 years 11 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou