Sciweavers

20 search results - page 2 / 4
» Design and Data Collection for the Accentological Corpus of ...
Sort
View
EMNLP
2009
13 years 3 months ago
Discriminative Corpus Weight Estimation for Machine Translation
Current statistical machine translation (SMT) systems are trained on sentencealigned and word-aligned parallel text collected from various sources. Translation model parameters ar...
Spyros Matsoukas, Antti-Veikko I. Rosti, Bing Zhan...
TBILLC
2005
Springer
13 years 11 months ago
Towards a Cross-Linguistic Production Data Archive: Structure and Exploration
The present paper presents the structure of a cross-linguistic database of production data. The database contains annotated texts collected from a sample of fifteen different langu...
Michael Götze, Stavros Skopeteas, Torsten Rol...
GRAPHICSINTERFACE
2003
13 years 7 months ago
Input-based Language Modelling in the Design of High Performance Text Input Techniques
We present a critique of language-based modelling for text input research, and propose an alternative inputbased approach. Current language-based statistical models are derived fr...
R. William Soukoreff, I. Scott MacKenzie
NAACL
1994
13 years 7 months ago
MACROPHONE: An American English Telephone Speech Corpus
Macrophone is a corpus of approximately 200,000 utterances, recorded over the telephone from a broad sample of about 5,000 American speakers. Sponsored by the Linguistic Data Cons...
Kelsey Taussig, Jared Bernstein
EJC
2008
13 years 7 months ago
Center Fragments for Upscaling and Verification in Database Semantics
The notion of a fragment was coined by Montague 1974 to illustrate the formal handling of certain puzzles, such as de dicto/de re, in a truth-conditional semantics for natural lan...
Roland Hausser