Sciweavers

20 search results - page 1 / 4
» Design and Data Collection for the Accentological Corpus of ...
Sort
View
LREC
2010
178views Education» more  LREC 2010»
13 years 6 months ago
Design and Data Collection for the Accentological Corpus of the Russian Language
Accentological corpus provides a researcher an opportunity to study word stress and stress variation, which are very important for the Russian language. Moreover, Accentological c...
Elena Grishina, Svetlana Savchuk, Alexej Poljakov
LREC
2008
117views Education» more  LREC 2008»
13 years 6 months ago
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition
This paper discusses the design, recording and preprocessing of a Czech sign language corpus. The corpus is intended for training and testing of sign language recognition (SLR) sy...
Pavel Campr, Marek Hrúz, Jana Trojanov&aacu...
LREC
2010
165views Education» more  LREC 2010»
13 years 6 months ago
Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus
After three years of work the Dutch Parallel Corpus (DPC) project has reached an end. The finalized corpus is a ten-million-word high-quality sentence-aligned bidirectional parall...
Orphée De Clercq, Maribel Montero Perez
EMNLP
2009
13 years 3 months ago
Using the Web for Language Independent Spellchecking and Autocorrection
We have designed, implemented and evaluated an end-to-end system spellchecking and autocorrection system that does not require any manually annotated training data. The World Wide...
Casey Whitelaw, Ben Hutchinson, Grace Chung, Ged E...
LREC
2008
165views Education» more  LREC 2008»
13 years 6 months ago
Design and Data Collection for Spoken Polish Dialogs Database
Spoken corpora provide a critical resource for research, development and evaluation of spoken dialog systems. This paper describes the telephone spoken dialog corpus for Polish cr...
Krzysztof Marasek, Ryszard Gubrynowicz