Sciweavers

301 search results - page 3 / 61
» Audio-Visual Speaker Localization Using Graphical Models
Sort
View
INTERSPEECH
2010
13 years 25 days ago
Adaptation of a tongue shape model by local feature transformations
Reconstructing the full contour of the tongue from the position of 3 to 4 landmarks on it is useful in articulatory speech work. This can be done with submillimetric accuracy usin...
Chao Qin, Miguel Á. Carreira-Perpiñ&...
TCSV
2008
125views more  TCSV 2008»
13 years 5 months ago
Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization
This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
H. Vajaria, S. Sarkar, R. Kasturi
TASLP
2008
154views more  TASLP 2008»
13 years 6 months ago
Capturing Local Variability for Speaker Normalization in Speech Recognition
The new model reduces the impact of local spectral and temporal variability by estimating a finite set of spectral and temporal warping factors which are applied to speech at the f...
Antonio Miguel, Eduardo Lleida, Richard Rose, Luis...
ISM
2008
IEEE
136views Multimedia» more  ISM 2008»
14 years 12 days ago
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments
We propose a multimodal speaker segmentation algorithm with two main contributions: First, we suggest a hidden Markov model architecture that performs fusion of the three modaliti...
Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgi...
BMVC
2010
13 years 4 months ago
Improved Anatomical Landmark Localization in Medical Images Using Dense Matching of Graphical Models
We propose a method for reliably and accurately identifying anatomical landmarks in 3D CT volumes based on dense matching of parts-based graphical models. Such a system can be use...
Vaclav Potesil, Timor Kadir, Günther Platsch,...