Sciweavers

8 search results - page 1 / 2
» Acquiring Speech Transcriptions Using Mismatched Crowdsourci...
Sort
View
ICASSP
2010
IEEE
13 years 5 months ago
A kernel mean matching approach for environment mismatch compensation in speech recognition
The mismatch between training and test environmental conditions presents a challenge to speech recognition systems. In this paper, we investigate an approach for matching the dist...
Abhishek Kumar, John H. L. Hansen
IJCNLP
2004
Springer
13 years 10 months ago
Detecting Sentence Boundaries in Japanese Speech Transcriptions Using a Morphological Analyzer
We present a method to automatically detect sentence boundaries(SBs) in Japanese speech transcriptions. Our method uses a Japanese morphological analyzer that is based on a cost c...
Sachie Tajima, Hidetsugu Nanba, Manabu Okumura
COGSCI
2002
99views more  COGSCI 2002»
13 years 4 months ago
Learning words from sights and sounds: a computational model
This paper presents an implemented computational model of word acquisition which learns directly from raw multimodal sensory input. Set in an information theoretic framework, the ...
Deb Roy, Alex Pentland
ICASSP
2011
IEEE
12 years 8 months ago
Source-normalised-and-weighted LDA for robust speaker recognition using i-vectors
The recently developed i-vector framework for speaker recognition has set a new performance standard in the research field. An i-vector is a compact representation of a speaker u...
Mitchell McLaren, David A. van Leeuwen
TMM
2010
224views Management» more  TMM 2010»
12 years 11 months ago
A 3-D Audio-Visual Corpus of Affective Communication
Communication between humans deeply relies on the capability of expressing and recognizing feelings. For this reason, research on human-machine interaction needs to focus on the re...
Gabriele Fanelli, Jürgen Gall, Harald Romsdor...