Sciweavers

ICASSP
2008
IEEE

Gaze-contingent asr for spontaneous, conversational speech: An evaluation

13 years 11 months ago
Gaze-contingent asr for spontaneous, conversational speech: An evaluation
There has been little work that attempts to improve the recognition of spontaneous, conversational speech by adding information from a loosely-coupled modality. This study investigated this idea by integrating information from gaze into an ASR system. A probabilistic framework for multimodal recognition was formalised and applied to the specific case of integrating gaze and speech. Gaze-contingent ASR systems were developed from a baseline ASR system by redistributing language model probability mass according to the visual attention. The best performing systems had similar Word Error Rates to the baseline ASR system and showed an increase in keyword spotting accuracy. The key finding was that performance improvements observed were due to increased recognition accuracy for words associated with the visual field but not the current focus of visual attention.
Neil Cooke, Martin J. Russell
Added 30 May 2010
Updated 30 May 2010
Type Conference
Year 2008
Where ICASSP
Authors Neil Cooke, Martin J. Russell
Comments (0)