Sciweavers

480 search results - page 19 / 96
» Audio segmentation for speech recognition using segment feat...
Sort
View
ACIVS
2009
Springer
15 years 10 months ago
Two-Level Bimodal Association for Audio-Visual Speech Recognition
This paper proposes a new method for bimodal information fusion in audio-visual speech recognition, where cross-modal association is considered in two levels. First, the acoustic a...
Jong-Seok Lee, Touradj Ebrahimi
IJON
2002
85views more  IJON 2002»
15 years 3 months ago
Learning statistically efficient features for speaker recognition
We apply independent component analysis (ICA) for extracting an optimal basis to the problem of finding efficient features for a speaker. The basis functions learned by the algori...
Gil-Jin Jang, Te-Won Lee, Yung-Hwan Oh
UIST
1992
ACM
15 years 8 months ago
Tools for Building Asynchronous Servers to Support Speech and Audio Applications
Distributed clientisewer models are becoming increasingly prevalent in multimedia systems and advanced user interface design. A multimedia application, for example, may play and r...
Barry Arons
ICAPR
2001
Springer
15 years 8 months ago
A Neural Multi-expert Classification System for MPEG Audio Segmentation
The current research efforts in the field of video parsing and analysis are mainly focused on the use of pictorial information, while neglecting an important supplementary source ...
Massimo De Santo, Gennaro Percannella, Carlo Sanso...
ICPR
2010
IEEE
15 years 7 months ago
Crossmodal Matching of Speakers Using Lip and Voice Features in Temporally Non-Overlapping Audio and Video Streams
Person identification using audio (speech) and visual (facial appearance, static or dynamic) modalities, either independently or jointly, is a thoroughly investigated problem in pa...
Anindya Roy, Sebastien Marcel