Learning a language model from continuous speech

12 years 11 months ago

Download www.phontron.com

This paper presents a new approach to language model construction, learning a language model not from text, but directly from continuous speech. A phoneme lattice is created using acoustic model scores, and Bayesian techniques are used to robustly learn a language model from this noisy input. A novel sampling technique is devised that allows for the integrated learning of word boundaries and an n-gram language model with no prior linguistic knowledge. The proposed techniques were used to learn a language model directly from continuous, potentially large-vocabulary speech. This language model was able to significantly reduce the ASR phoneme error rate over a separate set of test data, and the proposed lattice processing and lexical acquisition techniques were found to be important factors in this improvement.

Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsu

Real-time Traffic

Acoustic Model Scores | INTERSPEECH 2010 | Language Model | N-gram Language Model | Signal Processing |

claim paper

» Mapping from Speech to Images Using Continuous State Space Models

» Speech modeling based on committeebased active learning

» Multiview and multiobjective semisupervised learning for large vocabulary continuous speec...

» Learning vocal tract variables with multitask kernels

» Language Identification via Large Vocabulary Speaker Independent Continuous Speech Recogni...

» Thai spelling analysis for automatic spelling speech recognition

» Grounded Language Modeling for Automatic Speech Recognition of Sports Video

» Automated Extraction of Signs from Continuous Sign Language Sentences using Iterated Condi...

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsuya Kawahara

Comments (0)

Sciweavers

Learning a language model from continuous speech

Acoustic Model Scores | INTERSPEECH 2010 | Language Model | N-gram Language Model | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers