Continuous speech recognition with a TF-IDF acoustic model

12 years 11 months ago

Download research.microsoft.com

Information retrieval methods are frequently used for indexing and retrieving spoken documents, and more recently have been proposed for voice-search amongst a pre-defined set of business entries. In this paper, we show that these methods can be used in an even more fundamental way, as the core component in a continuous speech recognizer. Speech is initially processed and represented as a sequence of discrete symbols, specifically phoneme or multi-phone units. Recognition then operates on this sequence. The recognizer is segment-based, and the acoustic score for labeling a segment with a word is based on the TF-IDF similarity between the subword units detected in the segment, and those typically seen in association with the word. We present promising results on both a voice search task and the Wall Street Journal task. The development of this method brings us one step closer to being able to do speech recognition based on the detection of sub-word audio attributes.

Geoffrey Zweig, Patrick Nguyen, Jasha Droppo, Alex

Real-time Traffic

Continuous Speech Recognizer | Information Retrieval Methods | INTERSPEECH 2010 | Signal Processing | Sub-word Audio Attributes |

claim paper

» Multiview and multiobjective semisupervised learning for large vocabulary continuous speec...

» An investigation of subspace modeling for phonetic and speaker variability in automatic sp...

» Discriminative training of hierarchical acoustic models for large vocabulary continuous sp...

» Early recognition of polysyllabic words in continuous speech

» Automatic Speech Recognition for Polish in a Computer Game Interface

» Machine and acoustical condition dependency analyses for fast acoustic likelihood calculat...

» A discriminative splitting criterion for phonetic decision trees

» Error Approximation and Minimum Phone Error Acoustic Model Estimation

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Geoffrey Zweig, Patrick Nguyen, Jasha Droppo, Alex Acero

Comments (0)

Sciweavers

Continuous speech recognition with a TF-IDF acoustic model

Continuous Speech Recognizer | Information Retrieval Methods | INTERSPEECH 2010 | Signal Processing | Sub-word Audio Attributes |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers