Search Sciweavers | Sciweavers

151

ICASSP
2010
IEEE

152views Signal Processing» more ICASSP 2010»

HMM-based sequence-to-frame mapping for voice conversion

15 years 6 months ago

Voice conversion can be reduced to a problem to ﬁnd a transformation function between the corresponding speech sequences of two speakers. Perhaps the most voice conversions meth...

Yu Qiao, Daisuke Saito, Nobuaki Minematsu

claim paper

Read More »

166

click to vote

SETN
2010
Springer

250views Artificial Intelligence» more SETN 2010»

Feature Selection for Improved Phone Duration Modeling of Greek Emotional Speech

16 years 20 days ago

Download www.wcl.ece.upatras.gr

In the present work we address the problem of phone duration modeling for the needs of emotional speech synthesis. Specifically, relying on ten well known machine learning techniqu...

Alexandros Lazaridis, Todor Ganchev, Iosif Mporas,...

claim paper

Read More »

118

click to vote

INTERSPEECH
2010

122views Signal Processing» more INTERSPEECH 2010»

Building transcribed speech corpora quickly and cheaply for many languages

15 years 20 days ago

Download static.googleusercontent.com

We present a system for quickly and cheaply building transcribed speech corpora containing utterances from many speakers in a variety of acoustic conditions. The system consists o...

Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu...

claim paper

Read More »

149

click to vote

INTERSPEECH
2010

123views Signal Processing» more INTERSPEECH 2010»

Measuring basic tempo across languages and some implications for speech rhythm

15 years 20 days ago

Download wwwu.uni-klu.ac.at

Basic language-inherent tempo cannot be isolated by the current metrics of speech rhythm. Here we propose the number of syllables per intonation unit as an appropriate measure, al...

Gertraud Fenk-Oczlon, August Fenk

claim paper

Read More »

147

click to vote

INTERSPEECH
2010

167views Signal Processing» more INTERSPEECH 2010»

Canonical state models for automatic speech recognition

15 years 20 days ago

Download mi.eng.cam.ac.uk

Current speech recognition systems are often based on HMMs with state-clustered Gaussian Mixture Models (GMMs) to represent the context dependent output distributions. Though high...

Mark J. F. Gales, Kai Yu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers