Sciweavers

MLMI
2007
Springer
13 years 11 months ago
Posterior-Based Features and Distances in Template Matching for Speech Recognition
The use of large speech corpora in example-based approaches for speech recognition is mainly focused on increasing the number of examples. This strategy presents some difficulties ...
Guillermo Aradilla, Hervé Bourlard
MLMI
2007
Springer
13 years 11 months ago
Integrating Semantics into Multimodal Interaction Patterns
A user experiment on multimodal interaction (speech, hand position and hand shapes) to study two major relationships: between the level of cognitive load experienced by users and t...
Ronnie Taib, Natalie Ruiz
MLMI
2007
Springer
13 years 11 months ago
Conditional Sequence Model for Context-Based Recognition of Gaze Aversion
Eye gaze and gesture form key conversational grounding cues that are used extensively in face-to-face interaction among people. To accurately recognize visual feedback during inter...
Louis-Philippe Morency, Trevor Darrell
MLMI
2007
Springer
13 years 11 months ago
To Separate Speech
The PASCAL Speech Separation Challenge (SSC) is based on a corpus of sentences from the Wall Street Journal task read by two speakers simultaneously and captured with two circular ...
John W. McDonough, Ken'ichi Kumatani, Tobias Gehri...
MLMI
2007
Springer
13 years 11 months ago
Microphone Array Beamforming Approach to Blind Speech Separation
In this paper, we present a microphone array beamforming approach to blind speech separation. Unlike previous beamforming approaches, our system does not require a-priori knowledge...
Ivan Himawan, Iain McCowan, Mike Lincoln
MLMI
2007
Springer
13 years 11 months ago
Automatic Decision Detection in Meeting Speech
Abstract. Decision making is an important aspect of meetings in organisational settings, and archives of meeting recordings constitute a valuable source of information about the de...
Pei-yun Hsueh, Johanna D. Moore
MLMI
2007
Springer
13 years 11 months ago
A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems
In this paper we present a study of automatic speech recognition systems using context-dependent phonemes and graphemes as sub-word units based on the conventional HMM/GMM system a...
John Dines, Mathew Magimai-Doss
MLMI
2007
Springer
13 years 11 months ago
Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech
In conversational speech, irregularities in the speech such as overlaps and disruptions make it difficult to decide what is a sentence. Thus, despite very precise guidelines on how...
Sébastien Cuendet, Dilek Z. Hakkani-Tü...
MLMI
2007
Springer
13 years 11 months ago
Gaussian Process Latent Variable Models for Human Pose Estimation
We describe a method for recovering 3D human body pose from silhouettes. Our model is based on learning a latent space using the Gaussian Process Latent Variable Model (GP-LVM) [1]...
Carl Henrik Ek, Philip H. S. Torr, Neil D. Lawrenc...