Sciweavers

735 search results - page 39 / 147
» Corpora and data preparation
Sort
View
COLING
1994
14 years 11 months ago
Machine-Readable Dictionaries in Text-to-Speech Systems
This paper presents the results of an experiment usiug machine-readable dictionaries (Mill)s) and corpora for building concatenativc units for text to speech (T'PS) systems. ...
Judith Klavans, Evelyne Tzoukermann
LREC
2008
141views Education» more  LREC 2008»
14 years 11 months ago
New Resources for Document Classification, Analysis and Translation Technologies
The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...
Stephanie Strassel, Lauren Friedman, Safa Ismael, ...
LREC
2010
213views Education» more  LREC 2010»
14 years 11 months ago
Active Learning and Crowd-Sourcing for Machine Translation
In recent years, corpus based approaches to machine translation have become predominant, with Statistical Machine Translation (SMT) being the most actively progressing area. Succe...
Vamshi Ambati, Stephan Vogel, Jaime G. Carbonell
LREC
2010
153views Education» more  LREC 2010»
14 years 11 months ago
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank
Corpora of sentences annotated with grammatical information have been deployed by extending the basic lexical and morphological data with increasingly complex information, such as...
António Branco, Francisco Costa, Joã...
NAACL
2007
14 years 11 months ago
Lexicalized Markov Grammars for Sentence Compression
We present a sentence compression system based on synchronous context-free grammars (SCFG), following the successful noisy-channel approach of (Knight and Marcu, 2000). We define...
Michel Galley, Kathleen McKeown