A flat direct model for speech recognition

14 years 4 months ago

Download research.microsoft.com

We introduce a direct model for speech recognition that assumes an unstructured, i.e., ﬂat text output. The ﬂat model allows us to model arbitrary attributes and dependences of the output. This is different from the HMMs typically used for speech recognition. This conventional modeling approach is based on sequential data and makes rigid assumptions on the dependences. HMMs have proven to be convenient and appropriate for large vocabulary continuous speech recognition. Our task under consideration, however, is the Windows Live Search for Mobile (WLS4M) task [1]. This is a cellphone application that allows users to interact with web-based information portals. In particular, the set of valid outputs can be considered discrete and ﬁnite (although probably large, i.e., unseen events are an issue). Hence, a ﬂat direct model lends itself to this task, making the adding of different knowledge sources and dependences straightforward and cheap. Using e.g. HMM posterior, m-gram, and spo...

Georg Heigold, Geoffrey Zweig, Xiao Li, Patrick Ng

Real-time Traffic

Continuous Speech Recognition | Direct Model | ICASSP 2009 | Signal Processing | Speech Recognition |

claim paper

» An overview of textindependent speaker recognition From features to supervectors

» Modified MMIMPE a direct evaluation of the margin in speech recognition

» Robust speech recognition using multiple prior models for speech reconstruction

» Structured discriminative models for noise robust continuous speech recognition

» Discriminative template extraction for direct modeling

» Automatic speech recognition system channel modeling

» Integrated Techniques for Phrase Extraction from Speech

» A Syntactic TimeSeries Model for Parsing Fluent and Disfluent Speech

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Georg Heigold, Geoffrey Zweig, Xiao Li, Patrick Nguyen

Comments (0)

Sciweavers

A flat direct model for speech recognition

Continuous Speech Recognition | Direct Model | ICASSP 2009 | Signal Processing | Speech Recognition |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers