INTERSPEECH 2010 | Sciweavers

12

INTERSPEECH
2010

123views Signal Processing» more INTERSPEECH 2010»

Can conversational word usage be used to predict speaker demographics?

12 years 11 months ago

This work surveys the potential for predicting demographic traits of individual speakers (gender, age, education level, ethnicity, and geographic region) using only word usage fea...

Dan Gillick

claim paper

Read More »

8

click to vote

INTERSPEECH
2010

93views Signal Processing» more INTERSPEECH 2010»

Automatic discriminative measurement of voice onset time

12 years 11 months ago

Download people.cs.uchicago.edu

Morgan Sonderegger, Joseph Keshet

claim paper

Read More »

11

click to vote

INTERSPEECH
2010

95views Signal Processing» more INTERSPEECH 2010»

Learning speaker normalization using semisupervised manifold alignment

12 years 11 months ago

Download www.tc.umn.edu

As a child acquires language, he or she: perceives acoustic information in his or her surrounding environment; identifies portions of the ambient acoustic information as languager...

Andrew R. Plummer, Mary E. Beckman, Mikhail Belkin...

claim paper

Read More »

12

click to vote

INTERSPEECH
2010

125views Signal Processing» more INTERSPEECH 2010»

What else is new than the hamming window? robust MFCCs for speaker recognition via multitapering

12 years 11 months ago

Download cs.joensuu.fi

Usually the mel-frequency cepstral coefficients (MFCCs) are derived via Hamming windowed DFT spectrum. In this paper, we advocate to use a so-called multitaper method instead. Mul...

Tomi Kinnunen, Rahim Saeidi, Johan Sandberg, Maria...

claim paper

Read More »

13

click to vote

INTERSPEECH
2010

130views Signal Processing» more INTERSPEECH 2010»

HMM adaptation using linear spline interpolation with integrated spline parameter training for robust speech recognition

12 years 11 months ago

Download research.microsoft.com

We recently proposed a method for HMM adaptation to noisy environments called Linear Spline Interpolation (LSI). LSI uses linear spline regression to model the relationship betwee...

Michael L. Seltzer, Alex Acero

claim paper

Read More »

16

click to vote

INTERSPEECH
2010

114views Signal Processing» more INTERSPEECH 2010»

Fully automatic segmentation for prosodic speech corpora

12 years 11 months ago

Download www.tik.ee.ethz.ch

While automatic methods for phonetic segmentation of speech can help with rapid annotation of corpora, most methods rely either on manually segmented data to initially train the p...

Sarah Hoffmann, Beat Pfister

claim paper

Read More »

12

click to vote

INTERSPEECH
2010

99views Signal Processing» more INTERSPEECH 2010»

Acoustic feature analysis in speech emotion primitives estimation

12 years 11 months ago

Download www-scf.usc.edu

We recently proposed a family of robust linear and nonlinear estimation techniques for recognizing the three emotion primitives

Dongrui Wu, Thomas D. Parsons, Shrikanth S. Naraya...

claim paper

Read More »

12

click to vote

INTERSPEECH
2010

127views Signal Processing» more INTERSPEECH 2010»

Setup for acoustic-visual speech synthesis by concatenating bimodal units

12 years 11 months ago

Download hal.archives-ouvertes.fr

This paper presents preliminary work on building a system able to synthesize concurrently the speech signal and a 3D animation of the speaker's face. This is done by concaten...

Asterios Toutios, Utpala Musti, Slim Ouni, Vincent...

claim paper

Read More »

14

click to vote

INTERSPEECH
2010

180views Signal Processing» more INTERSPEECH 2010»

Deep-structured hidden conditional random fields for phonetic recognition

12 years 11 months ago

Download research.microsoft.com

We extend our earlier work on deep-structured conditional random field (DCRF) and develop deep-structured hidden conditional random field (DHCRF). We investigate the use of this n...

Dong Yu, Li Deng

claim paper

Read More »

28

click to vote

INTERSPEECH
2010

242views Signal Processing» more INTERSPEECH 2010»

Recurrent neural network based language model

12 years 11 months ago

Download www.fit.vutbr.cz

A new recurrent neural network based language model (RNN LM) with applications to speech recognition is presented. Results indicate that it is possible to obtain around 50% reduct...

Tomas Mikolov, Martin Karafiát, Lukas Burge...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers