Contextual Information Improves OOV Detection in Speech

15 years 3 months ago

Download www.cs.jhu.edu

Out-of-vocabulary (OOV) words represent an important source of error in large vocabulary continuous speech recognition (LVCSR) systems. These words cause recognition failures, which propagate through pipeline systems impacting the performance of downstream applications. The detection of OOV regions in the output of a LVCSR system is typically addressed as a binary classification task, where each region is independently classified using local information. In this paper, we show that jointly predicting OOV regions, and including contextual information from each region, leads to substantial improvement in OOV detection. Compared to the state-of-the-art, we reduce the missed OOV rate from 42.6% to 28.4% at 10% false alarm rate.

Carolina Parada, Mark Dredze, Denis Filimonov, Fre

Real-time Traffic

Binary Classification Task | Computational Linguistics | False Alarm Rate | NAACL 2010 | OOV Regions |

claim paper

» Recovery of Rare Words in Lecture Speech

» Combination of strongly and weakly constrained recognizers for reliable detection of OOVS

» Confidence estimation OOV detection and language ID using phonetoword transduction and pho...

» EnglishChinese BiDirectional OOV Translation based on Web Mining and Supervised Learning

» Quantifying and Transferring Contextual Information in Object Detection

» Handling OutofVocabulary Words and Recognition Errors Based on Word Linguistic Context for...

» Multimodal new vocabulary recognition through speech and handwriting in a whiteboard sched...

» Joint MorphologicalLexical Language Modeling for Processing Morphologically Rich Languages...

Post Info
More Details (n/a)

Added	14 Feb 2011
Updated	14 Feb 2011
Type	Journal
Year	2010
Where	NAACL
Authors	Carolina Parada, Mark Dredze, Denis Filimonov, Frederick Jelinek

Comments (0)

Sciweavers

Contextual Information Improves OOV Detection in Speech

Binary Classification Task | Computational Linguistics | False Alarm Rate | NAACL 2010 | OOV Regions |

Explore & Download

Productivity Tools

Sciweavers