Sciweavers

1423 search results - page 212 / 285
» Polyphase speech recognition
Sort
View
ICDE
2006
IEEE
262views Database» more  ICDE 2006»
15 years 10 months ago
The eNTERFACE'05 Audio-Visual Emotion Database
This paper presents an audio-visual emotion database that can be used as a reference database for testing and evaluating video, audio or joint audio-visual emotion recognition alg...
O. Martin, Irene Kotsia, Benoit M. Macq, Ioannis P...
ICMCS
2005
IEEE
123views Multimedia» more  ICMCS 2005»
15 years 10 months ago
Improved face finding in visually challenging environments
Finding faces in visually challenging environments is crucial to many applications, such as audio-visual automatic speech recognition, video indexing, person recognition, and vide...
Jintao Jiang, Gerasimos Potamianos, Giridharan Iye...
ICMCS
2005
IEEE
169views Multimedia» more  ICMCS 2005»
15 years 10 months ago
Dynamic language model adaptation using latent topical information and automatic transcripts
This paper considers dynamic language model adaptation for Mandarin broadcast news recognition. Both contemporary newswire texts and in-domain automatic transcripts were exploited...
Berlin Chen
HRI
2010
ACM
15 years 9 months ago
Recognizing engagement in human-robot interaction
—Based on a study of the engagement process between humans, we have developed and implemented an initial computational model for recognizing engagement between a human and a huma...
Charles Rich, Brett Ponsleur, Aaron Holroyd, Canda...
ICASSP
2009
IEEE
15 years 8 months ago
Neural network based language models for highly inflective languages
Speech recognition of inflectional and morphologically rich languages like Czech is currently quite a challenging task, because simple n-gram techniques are unable to capture impo...
Tomas Mikolov, Jirí Kopecký, Lukas B...