Sciweavers

5 search results - page 1 / 1
» Joint encoding of the waveform and speech recognition featur...
Sort
View
ICASSP
2011
IEEE
12 years 8 months ago
Joint encoding of the waveform and speech recognition features using a transform codec
We propose a new transform speech codec that jointly encodes a wideband waveform and its corresponding wideband and narrowband speech recognition features. For distributed speech ...
Xing Fan, Michael L. Seltzer, Jasha Droppo, Henriq...
ICASSP
2010
IEEE
13 years 3 months ago
Simple methods for improving speaker-similarity of HMM-based speech synthesis
In this paper we revisit some basic configuration choices of HMMbased speech synthesis, such as waveform sampling rate, auditory frequency warping scale and the logarithmic scali...
Junichi Yamagishi, Simon King
ICASSP
2010
IEEE
13 years 3 months ago
A comparison of approaches for modeling prosodic features in speaker recognition
Prosodic information has been successfully used for speaker recognition for more than a decade. The best-performing prosodic system to date has been one based on features extracte...
Luciana Ferrer, Nicolas Scheffer, Elizabeth Shribe...
TASLP
2008
131views more  TASLP 2008»
13 years 4 months ago
Histogram-Based Quantization for Robust and/or Distributed Speech Recognition
Abstract--In a distributed speech recognition (DSR) framework, the speech features are quantized and compressed at the client and recognized at the server. However, recognition acc...
Chia-Yu Wan, Lin-Shan Lee
INTERSPEECH
2010
12 years 11 months ago
Robust automatic speech recognition with decoder oriented ideal binary mask estimation
In this paper, we propose a joint optimal method for automatic speech recognition (ASR) and ideal binary mask (IBM) estimation in transformed into the cepstral domain through a ne...
Lae-Hoon Kim, Kyung-Tae Kim, Mark Hasegawa-Johnson