Sciweavers

18 search results - page 1 / 4
» Combining Visual and Acoustic Speech Signals with a Neural N...
Sort
View
CVIU
2008
118views more  CVIU 2008»
13 years 5 months ago
Multimodal person authentication using speech, face and visual speech
This paper presents a method for automatic multimodal person authentication using speech, face and visual speech modalities. The proposed method uses the motion information to loc...
S. Palanivel, B. Yegnanarayana
ICASSP
2011
IEEE
12 years 9 months ago
The IBM 2009 GALE Arabic speech transcription system
We describe the Arabic broadcast transcription system elded by IBM in the GALE Phase 4 machine translation evaluation. Key advances over our Phase 3.5 system include improvements ...
Brian Kingsbury, Hagen Soltau, George Saon, Stephe...
IAJIT
2010
102views more  IAJIT 2010»
13 years 3 months ago
Multilayer neural network-burg combination for acoustical detection of buried objects
: A Burg technique is employed to model the long wavelength localization and imaging problem. A Burg method is used as a high resolution and stable technique. The idea of in-line h...
Mujahid Al-Azzo, Lubna Badri
ICASSP
2010
IEEE
13 years 5 months ago
The IBM 2008 GALE Arabic speech transcription system
This paper describes the Arabic broadcast transcription system fielded by IBM in the GALE Phase 3.5 machine translation evaluation. Key advances compared to our Phase 2.5 system ...
George Saon, Hagen Soltau, Upendra Chaudhari, Step...