Sciweavers

1423 search results - page 185 / 285
» Polyphase speech recognition
Sort
View
INTERSPEECH
2010
14 years 11 months ago
What else is new than the hamming window? robust MFCCs for speaker recognition via multitapering
Usually the mel-frequency cepstral coefficients (MFCCs) are derived via Hamming windowed DFT spectrum. In this paper, we advocate to use a so-called multitaper method instead. Mul...
Tomi Kinnunen, Rahim Saeidi, Johan Sandberg, Maria...
ICASSP
2011
IEEE
14 years 8 months ago
Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations
We present a study on purely data-based recognition of animal sounds, performing evaluation on a real-world database obtained from the Humboldt-University Animal Sound Archive. As...
Felix Weninger, Björn Schuller
ICASSP
2011
IEEE
14 years 8 months ago
A cochlear neuron based robust feature for speaker recognition
In this paper, a robust feature for text-independent speaker recognition is proposed, which simulate the response mode of cochlear neurons in processing acoustic signal. The featu...
Datao You, Tao Jiang, Jiqing Han, Tieran Zheng
IJVR
2007
123views more  IJVR 2007»
15 years 4 months ago
Towards Sociable Virtual Humans: Multimodal Recognition of Human Input and Behavior
—One of the biggest obstacles for constructing effective sociable virtual humans lies in the failure of machines to recognize the desires, feelings and intentions of the human us...
Christian Eckes, Konstantin Biatov, Frank Hül...
3DPVT
2004
IEEE
285views Visualization» more  3DPVT 2004»
15 years 8 months ago
Speech-Driven Face Synthesis from 3D Video
This paper presents a framework for speech-driven synthesis of real faces from a corpus of 3D video of a person speaking. Video-rate capture of dynamic 3D face shape and colour ap...
Ioannis A. Ypsilos, Adrian Hilton, Aseel Turkmani,...