Sciweavers

INTERSPEECH
2010
12 years 11 months ago
Regularized-MLLR speaker adaptation for computer-assisted language learning system
In this paper, we propose a novel speaker adaptation technique, regularized-MLLR, for Computer Assisted Language Learning (CALL) systems. This method uses a linear combination of ...
Dean Luo, Yu Qiao, Nobuaki Minematsu, Yutaka Yamau...
INTERSPEECH
2010
12 years 11 months ago
Data-dependent evaluator modeling and its application to emotional valence classification from speech
Practical supervised learning scenarios involving subjectively evaluated data have multiple evaluators, each giving their noisy version of the hidden ground truth. Majority logic ...
Kartik Audhkhasi, Shrikanth S. Narayanan
INTERSPEECH
2010
12 years 11 months ago
Automatic selection of thresholds for signal separation algorithms based on interaural delay
In this paper we describe a system that separates signals by comparing the interaural time delays (ITDs) of their timefrequency components to a fixed threshold ITD. While in previ...
Chanwoo Kim, Richard M. Stern, Kiwan Eom, Jaewon L...
INTERSPEECH
2010
12 years 11 months ago
Search by voice in Mandarin Chinese
In this paper we describe our efforts to build a Mandarin Chinese voice search system. We describe our strategies for data collection, language, lexicon and acoustic modeling, as ...
Jiulong Shan, Genqing Wu, Zhihong Hu, Xiliu Tang, ...
INTERSPEECH
2010
12 years 11 months ago
Shape-invariant speech transformation with the phase vocoder
This paper proposes a new phase vocoder based method for shape invariant real-time modification of speech signals. The performance of the method with respect voiced and unvoiced s...
Axel Röbel
INTERSPEECH
2010
12 years 11 months ago
Semi-supervised extractive speech summarization via co-training algorithm
Supervised methods for extractive speech summarization require a large training set. Summary annotation is often expensive and time consuming. In this paper, we exploit semisuperv...
Shasha Xie, Hui Lin, Yang Liu
INTERSPEECH
2010
12 years 11 months ago
Mask estimation in non-stationary noise environments for missing feature based robust speech recognition
In missing feature based automatic speech recognition (ASR), the role of the spectro-temporal mask in providing an accurate description of the relationship between target speech a...
Shirin Badiezadegan, Richard C. Rose
INTERSPEECH
2010
12 years 11 months ago
A spoken term detection framework for recovering out-of-vocabulary words using the web
Vocabulary restrictions in large vocabulary continuous speech recognition (LVCSR) systems mean that out-of-vocabulary (OOV) words are lost in the output. However, OOV words tend t...
Carolina Parada, Abhinav Sethy, Mark Dredze, Frede...
INTERSPEECH
2010
12 years 11 months ago
Frequency of occurrence effects on pitch accent realisation
Katrin Schweitzer, Michael Walsh, Bernd Möbiu...