Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

15

ICASSP
2008
IEEE

favoriteEmaildiscussreport

174views Signal Processing» more ICASSP 2008»

Gaze-contingent asr for spontaneous, conversational speech: An evaluation

13 years 11 months ago

Gaze-contingent asr for spontaneous, conversational speech: An evaluation

Download www.eee.bham.ac.uk

There has been little work that attempts to improve the recognition of spontaneous, conversational speech by adding information from a loosely-coupled modality. This study investigated this idea by integrating information from gaze into an ASR system. A probabilistic framework for multimodal recognition was formalised and applied to the speciﬁc case of integrating gaze and speech. Gaze-contingent ASR systems were developed from a baseline ASR system by redistributing language model probability mass according to the visual attention. The best performing systems had similar Word Error Rates to the baseline ASR system and showed an increase in keyword spotting accuracy. The key ﬁnding was that performance improvements observed were due to increased recognition accuracy for words associated with the visual ﬁeld but not the current focus of visual attention.

Neil Cooke, Martin J. Russell

Real-time Traffic

Asr Systems | Baseline Asr | ICASSP 2008 | Signal Processing | Visual Attention |

claim paper

Related Content

» Onesided measures for evaluating ranked retrieval effectiveness with spontaneous conversat...

» Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Ne...

» Emotion recognition from speech Putting ASR in the loop

» Spoken document retrieval from callcenter conversations

» Syllabification of conversational speech using Bidirectional LongShortTerm Memory Neural N...

» Creating conversational interfaces for children

» Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reduci...

» Robust Multistream Keyword and Nonlinguistic Vocalization Detection for Computationally In...

» Advances in the CMUInteract Arabic GALE Transcription System

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Neil Cooke, Martin J. Russell

Comments (0)