Sciweavers

11 search results - page 2 / 3
» Speech fragment decoding techniques for simultaneous speaker...
Sort
View
TASLP
2010
133views more  TASLP 2010»
12 years 12 months ago
Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments
In the presence of environmental noise, speakers tend to adjust their speech production in an effort to preserve intelligible communication. The noise-induced speech adjustments, c...
Hynek Boril, John H. L. Hansen
ICPR
2010
IEEE
13 years 8 months ago
Crossmodal Matching of Speakers Using Lip and Voice Features in Temporally Non-Overlapping Audio and Video Streams
Person identification using audio (speech) and visual (facial appearance, static or dynamic) modalities, either independently or jointly, is a thoroughly investigated problem in pa...
Anindya Roy, Sebastien Marcel
LREC
2010
256views Education» more  LREC 2010»
13 years 6 months ago
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition
Audiovisual speech recognition (AVSR) systems have been proven superior over audio-only speech recognizers in noisy environments by incorporating features of the visual modality. ...
Alexander Vorwerk, Xiaohui Wang, Dorothea Kolossa,...
ICASSP
2011
IEEE
12 years 8 months ago
NAP for high level language identification
Varying channel conditions present a difficult problem for many speech technologies such as language identification (LID). Channel compensation techniques have been shown to sig...
Fred S. Richardson, William M. Campbell
ICMCS
2005
IEEE
102views Multimedia» more  ICMCS 2005»
13 years 10 months ago
A Probabilistic Framework for TV-News Stories Detection and Classification
In this paper we face the problem of partitioning the news videos into stories, and of their classification according to a predefined set of categories. In particular, we propose ...
Francesco Colace, Pasquale Foggia, Gennaro Percann...