Sciweavers

480 search results - page 42 / 96
» Audio segmentation for speech recognition using segment feat...
Sort
View
LREC
2010
256views Education» more  LREC 2010»
15 years 5 months ago
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition
Audiovisual speech recognition (AVSR) systems have been proven superior over audio-only speech recognizers in noisy environments by incorporating features of the visual modality. ...
Alexander Vorwerk, Xiaohui Wang, Dorothea Kolossa,...
ICMCS
2005
IEEE
68views Multimedia» more  ICMCS 2005»
15 years 9 months ago
Playing speech backwards for classification tasks
START SPEEDUP FACTOR [C] FASTER BACKWARDS REPLAY VIA TIME COMPRESSION JUMP WIDTH s j SEGMENT LENGTH s l FASTER BACKWARDS REPLAY VIA SEGMENT DROPPING MODIFIED BACKWARDS REPLAY [B] ...
Wolfgang Hürst, Tobias Lauer, Cedric Bür...
TIFS
2010
127views more  TIFS 2010»
15 years 2 months ago
Audio authenticity: detecting ENF discontinuity with high precision phase analysis
—This paper addresses a forensic tool used to assess audio authenticity. The proposed method is based on detecting phase discontinuity of the power grid signal; this signal, refe...
Daniel Patricio Nicolalde Rodríguez, Jos&ea...
MLMI
2005
Springer
15 years 9 months ago
Multimodal Integration for Meeting Group Action Segmentation and Recognition
We address the problem of segmentation and recognition of sequences of multimodal human interactions in meetings. These interactions can be seen as a rough structure of a meeting, ...
Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Pere...
ECCV
2006
Springer
16 years 6 months ago
TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation
Abstract. This paper proposes a new approach to learning a discriminative model of object classes, incorporating appearance, shape and context information efficiently. The learned ...
Jamie Shotton, John M. Winn, Carsten Rother, Anton...