INTERSPEECH 2010 | Sciweavers

10

INTERSPEECH
2010

109views Signal Processing» more INTERSPEECH 2010»

Chirp complex cepstrum-based decomposition for asynchronous glottal analysis

12 years 11 months ago

It was recently shown that complex cepstrum can be effectively used for glottal flow estimation by separating the causal and anticausal components of speech. In order to guarantee...

Thomas Drugman, Thierry Dutoit

claim paper

Read More »

10

click to vote

INTERSPEECH
2010

86views Signal Processing» more INTERSPEECH 2010»

Morphological and predictability effects on schwa reduction: the case of dutch word-initial syllables

12 years 11 months ago

Download pubman.mpdl.mpg.de

This corpus-based study shows that the presence and duration of schwa in Dutch word-initial syllables are affected by a word's predictability and its morphological structure....

Iris Hanique, Barbara Schuppler, Mirjam Ernestus

claim paper

Read More »

17

click to vote

INTERSPEECH
2010

107views Signal Processing» more INTERSPEECH 2010»

Towards mixed language speech recognition systems

12 years 11 months ago

Download infoscience.epfl.ch

Multilingual speech recognition obviously involves numerous research challenges, including common phoneme sets, adaptation on limited amount of training data, as well as mixed lan...

David Imseng, Hervé Bourlard, Mathew Magima...

claim paper

Read More »

8

click to vote

INTERSPEECH
2010

94views Signal Processing» more INTERSPEECH 2010»

Towards spoken term discovery at scale with zero resources

12 years 11 months ago

Download www.clsp.jhu.edu

Aren Jansen, Kenneth Church, Hynek Hermansky

claim paper

Read More »

26

click to vote

INTERSPEECH
2010

196views Signal Processing» more INTERSPEECH 2010»

Hierarchical bottle neck features for LVCSR

12 years 11 months ago

Download www-i6.informatik.rwth-aachen.de

This paper investigates the combination of different neural network topologies for probabilistic feature extraction. On one hand, a five-layer neural network used in bottle neck f...

Christian Plahl, Ralf Schlüter, Hermann Ney

claim paper

Read More »

13

click to vote

INTERSPEECH
2010

194views Signal Processing» more INTERSPEECH 2010»

Revisiting VTLN using linear transformation on conventional MFCC

12 years 11 months ago

Download www-i6.informatik.rwth-aachen.de

In this paper, we revisit the linear transformation for VTLN on conventional MFCC proposed by Sanand et al. in [1], using the idea of band-limited interpolation. The filter-bank i...

Doddipatla Rama Sanand, Ralf Schlüter, Herman...

claim paper

Read More »

11

click to vote

INTERSPEECH
2010

143views Signal Processing» more INTERSPEECH 2010»

HMM-based automatic visual speech segmentation using facial data

12 years 11 months ago

Download hal.inria.fr

We describe automatic visual speech segmentation using facial data captured by a stereo-vision technique. The segmentation is performed using an HMM-based forced alignment mechani...

Utpala Musti, Asterios Toutios, Slim Ouni, Vincent...

claim paper

Read More »

13

click to vote

INTERSPEECH
2010

122views Signal Processing» more INTERSPEECH 2010»

Comparison of approaches for instrumentally predicting the quality of text-to-speech systems

12 years 11 months ago

Download individual.utoronto.ca

In this paper, we compare and combine different approaches for instrumentally predicting the perceived quality of Text-to-Speech systems. First, a log-likelihood is determined by ...

Sebastian Möller, Florian Hinterleitner, Tiag...

claim paper

Read More »

18

click to vote

INTERSPEECH
2010

137views Signal Processing» more INTERSPEECH 2010»

Decision tree state clustering with word and syllable features

12 years 11 months ago

Download static.googleusercontent.com

In large vocabulary continuous speech recognition, decision trees are widely used to cluster triphone states. In addition to commonly used phonetically based questions, others hav...

Hank Liao, Christopher Alberti, Michiel Bacchiani,...

claim paper

Read More »

11

click to vote

INTERSPEECH
2010

123views Signal Processing» more INTERSPEECH 2010»

Measuring basic tempo across languages and some implications for speech rhythm

12 years 11 months ago

Download wwwu.uni-klu.ac.at

Basic language-inherent tempo cannot be isolated by the current metrics of speech rhythm. Here we propose the number of syllables per intonation unit as an appropriate measure, al...

Gertraud Fenk-Oczlon, August Fenk

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers