Sciweavers

249 search results - page 38 / 50
» Subspace Gaussian Mixture Models for speech recognition
Sort
View
80
Voted
INTERSPEECH
2010
14 years 4 months ago
Combining five acoustic level modeling methods for automatic speaker age and gender recognition
This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performanc...
Ming Li, Chi-Sang Jung, Kyu Jeong Han
72
Voted
ICASSP
2009
IEEE
15 years 4 months ago
On the phonetic information in ultrasonic microphone signals
We study the phonetic information in the signal from an ultrasonic “microphone”, a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-s...
Karen Livescu, Bo Zhu, James R. Glass
85
Voted
ICCV
2003
IEEE
15 years 11 months ago
Recognition of Group Activities using Dynamic Probabilistic Networks
Dynamic Probabilistic Networks (DPNs) are exploited for modelling the temporal relationships among a set of different object temporal events in the scene for a coherent and robust...
Shaogang Gong, Tao Xiang
ICASSP
2010
IEEE
14 years 9 months ago
HMM-based separation of acoustic transfer function for single-channel sound source localization
This paper presents a sound source (talker) localization method using only a single microphone, where a HMM (Hidden Markov Model) of clean speech is introduced to estimate the aco...
Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki
INTERSPEECH
2010
14 years 4 months ago
Recurrent neural network based language model
A new recurrent neural network based language model (RNN LM) with applications to speech recognition is presented. Results indicate that it is possible to obtain around 50% reduct...
Tomas Mikolov, Martin Karafiát, Lukas Burge...