Sciweavers

INTERSPEECH
2010
12 years 11 months ago
Topic and style-adapted language modeling for Thai broadcast news ASR
The amount of available Thai broadcast news transcribed text for training a language model is still very limited, comparing to other major languages. Since the construction of a b...
Markpong Jongtaveesataporn, Sadaoki Furui
INTERSPEECH
2010
12 years 11 months ago
SCARF: a segmental conditional random field toolkit for speech recognition
This paper describes a new toolkit - SCARF - for doing speech recognition with segmental conditional random fields. It is designed to allow for the integration of numerous, possib...
Geoffrey Zweig, Patrick Nguyen
INTERSPEECH
2010
12 years 11 months ago
Learning a language model from continuous speech
This paper presents a new approach to language model construction, learning a language model not from text, but directly from continuous speech. A phoneme lattice is created using...
Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsu...
COLING
2010
12 years 11 months ago
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets
An unsupervised discriminative training procedure is proposed for estimating a language model (LM) for machine translation (MT). An English-to-English synchronous context-free gra...
Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, Jason E...
COLING
2010
12 years 11 months ago
Local lexical adaptation in Machine Translation through triangulation: SMT helping SMT
We present a framework where auxiliary MT systems are used to provide lexical predictions to a main SMT system. In this work, predictions are obtained by means of pivoting via aux...
Josep Maria Crego, Aurélien Max, Fran&ccedi...
ICDAR
2009
IEEE
13 years 2 months ago
Language Model Integration for the Recognition of Handwritten Medieval Documents
Building recognition systems for historical documents is a difficult task. Especially, when it comes to medieval scripts. The complexity is mainly affected by the poor quality and...
Markus Wüthrich, Marcus Liwicki, Andreas Fisc...
ICDAR
2009
IEEE
13 years 2 months ago
Handling Out-of-Vocabulary Words and Recognition Errors Based on Word Linguistic Context for Handwritten Sentence Recognition
In this paper we investigate the use of linguistic information given by language models to deal with word recognition errors on handwritten sentences. We focus especially on error...
Solen Quiniou, Mohamed Cheriet, Éric Anquet...
EMNLP
2009
13 years 2 months ago
Semi-supervised Semantic Role Labeling Using the Latent Words Language Model
Semantic Role Labeling (SRL) has proved to be a valuable tool for performing automatic analysis of natural language texts. Currently however, most systems rely on a large training...
Koen Deschacht, Marie-Francine Moens
EMNLP
2009
13 years 2 months ago
Matching Reviews to Objects using a Language Model
We develop a general method to match unstructured text reviews to a structured list of objects. For this, we propose a language model for generating reviews that incorporates a de...
Nilesh N. Dalvi, Ravi Kumar, Bo Pang, Andrew Tomki...
EMNLP
2009
13 years 2 months ago
Language Models Based on Semantic Composition
In this paper we propose a novel statistical language model to capture long-range semantic dependencies. Specifically, we apply the concept of semantic composition to the problem ...
Jeff Mitchell, Mirella Lapata