INTERSPEECH 2010 | Sciweavers

12

INTERSPEECH
2010

116views Signal Processing» more INTERSPEECH 2010»

Glottal-based analysis of the lombard effect

12 years 11 months ago

The Lombard effect refers to the speech changes due to the immersion of the speaker in a noisy environment. Among these changes, studies have already reported acoustic modificatio...

Thomas Drugman, Thierry Dutoit

claim paper

Read More »

11

click to vote

INTERSPEECH
2010

140views Signal Processing» more INTERSPEECH 2010»

An improved wavelet-based dereverberation for robust automatic speech recognition

12 years 11 months ago

Download www.ar.media.kyoto-u.ac.jp

This paper presents an improved wavelet-based dereverberation method for automatic speech recognition (ASR). Dereverberation is based on filtering reverberant wavelet coefficients...

Randy Gomez, Tatsuya Kawahara

claim paper

Read More »

14

click to vote

INTERSPEECH
2010

138views Signal Processing» more INTERSPEECH 2010»

Discriminative adaptation for log-linear acoustic models

12 years 11 months ago

Download www-i6.informatik.rwth-aachen.de

Log-linear models have recently been used in acoustic modeling for speech recognition systems. This has been motivated by competitive results compared to systems based on Gaussian...

Jonas Lööf, Ralf Schlüter, Hermann ...

claim paper

Read More »

14

click to vote

INTERSPEECH
2010

90views Signal Processing» more INTERSPEECH 2010»

12 years 11 months ago

Pitch similarity in the vicinity of backchannels

Download www.cs.columbia.edu

Dynamic modeling of spoken dialogue seeks to capture how interlocutors change their speech over the course of a conversation. Much work has focused on how speakers adapt or entrai...

Mattias Heldner, Jens Edlund, Julia Hirschberg

claim paper

Read More »

14

click to vote

INTERSPEECH
2010

121views Signal Processing» more INTERSPEECH 2010»

Learning from human errors: prediction of phoneme confusions based on modified ASR training

12 years 11 months ago

Download medi.uni-oldenburg.de

In an attempt to improve models of human perception, the recognition of phonemes in nonsense utterances was predicted with automatic speech recognition (ASR) in order to analyze i...

Bernd T. Meyer, Birger Kollmeier

claim paper

Read More »

12

click to vote

INTERSPEECH
2010

118views Signal Processing» more INTERSPEECH 2010»

Expectations for discourse genre identification: a prosodic study

12 years 11 months ago

Download articles.ircam.fr

Speech can be divided into discourse genres based on the contextual environment it occurs in (e.g. political speech, sport commentary speech, etc.). The present study investigated...

Nicolas Obin, Volker Dellwo, Anne Lacheret, Xavier...

claim paper

Read More »

10

click to vote

INTERSPEECH
2010

100views Signal Processing» more INTERSPEECH 2010»

Improved neural network based language modelling and adaptation

12 years 11 months ago

Download mi.eng.cam.ac.uk

Neural network language models (NNLM) have become an increasingly popular choice for large vocabulary continuous speech recognition (LVCSR) tasks, due to their inherent generalisa...

Junho Park, Xunying Liu, Mark J. F. Gales, Philip ...

claim paper

Read More »

7

click to vote

INTERSPEECH
2010

122views Signal Processing» more INTERSPEECH 2010»

Building transcribed speech corpora quickly and cheaply for many languages

12 years 11 months ago

Download static.googleusercontent.com

We present a system for quickly and cheaply building transcribed speech corpora containing utterances from many speakers in a variety of acoustic conditions. The system consists o...

Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu...

claim paper

Read More »

10

click to vote

INTERSPEECH
2010

109views Signal Processing» more INTERSPEECH 2010»

Distribution and trichotomic realization of voiced velars in Japanese - an experimental study

12 years 11 months ago

Download www.geocities.jp

In this paper, we demonstrate the trichotomic realization of voiced velars in Japanese, challenging the traditional plosive/nasal dichotomy of velar allophones, and examine the di...

Shin-ichiro Sano, Tomohiko Ooigawa

claim paper

Read More »

13

click to vote

INTERSPEECH
2010

153views Signal Processing» more INTERSPEECH 2010»

Acoustic vector resampling for GMMSVM-based speaker verification

12 years 11 months ago

Download www.eie.polyu.edu.hk

Using GMM-supervectors as the input to SVM classifiers (namely, GMM-SVM) is one of the promising approaches to text-independent speaker verification. However, one unaddressed issu...

Man-Wai Mak, Wei Rao

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers