INTERSPEECH 2010 | Sciweavers

11

INTERSPEECH
2010

122views Signal Processing» more INTERSPEECH 2010»

Continuous speech recognition with a TF-IDF acoustic model

12 years 11 months ago

Information retrieval methods are frequently used for indexing and retrieving spoken documents, and more recently have been proposed for voice-search amongst a pre-defined set of ...

Geoffrey Zweig, Patrick Nguyen, Jasha Droppo, Alex...

claim paper

Read More »

6

click to vote

INTERSPEECH
2010

93views Signal Processing» more INTERSPEECH 2010»

Detecting categorical perception in continuous discrimination data

12 years 11 months ago

Download www.fon.hum.uva.nl

We present a method for assessing categorical perception from continuous discrimination data. Until recently, categorical perception of speech has exclusively been measured by dis...

Paul Boersma, Katerina Chládková

claim paper

Read More »

8

click to vote

INTERSPEECH
2010

114views Signal Processing» more INTERSPEECH 2010»

Phonetic realization of second occurrence focus in Japanese

12 years 11 months ago

Download www.ling.upenn.edu

Previous studies have recently agreed that second occurrence focus is phonetically realized as prosodic prominence. What has been missing in the previous studies, however, is a co...

Satoshi Nambu, Yong-cheol Lee

claim paper

Read More »

8

click to vote

INTERSPEECH
2010

116views Signal Processing» more INTERSPEECH 2010»

Speaking style dependency of formant targets

12 years 11 months ago

Download www.cslu.ogi.edu

Previous work on formant targets has assumed that these targets are independent of the speaking style. In this paper, we estimate consonant and vowel targets in a database of &quo...

Akiko Amano-Kusumoto, John-Paul Hosom, Alexander K...

claim paper

Read More »

10

click to vote

INTERSPEECH
2010

124views Signal Processing» more INTERSPEECH 2010»

Phonetic subspace mixture model for speaker diarization

12 years 11 months ago

Download www.iis.sinica.edu.tw

This paper presents an improved distance measure for speaker clustering in speaker diarization systems. The proposed phonetic subspace mixture (PSM) model introduces phonetic info...

I-Fan Chen, Shih-Sian Cheng, Hsin-Min Wang

claim paper

Read More »

11

click to vote

INTERSPEECH
2010

102views Signal Processing» more INTERSPEECH 2010»

An HMM trajectory tiling (HTT) approach to high quality TTS

12 years 11 months ago

Download festvox.org

We propose an HMM Trajectory Tiling (HTT) approach to high quality TTS, which is our entry to Blizzard Challenge 2010. In HTT, first refined HMM is trained with the Minimum Genera...

Yao Qian, Zhi-Jie Yan, Yijian Wu, Frank K. Soong, ...

claim paper

Read More »

8

click to vote

INTERSPEECH
2010

106views Signal Processing» more INTERSPEECH 2010»

Laryngealization and features for Chinese tonal recognition

12 years 11 months ago

Download www.linguistics.ucla.edu

It is well known that the lowest tone in Mandarin, a language without contrastive phonation, often co-occurs with laryngealization/creaky voice quality, and we provide evidence th...

Kristine M. Yu

claim paper

Read More »

9

click to vote

INTERSPEECH
2010

123views Signal Processing» more INTERSPEECH 2010»

A comparative large scale study of MLP features for Mandarin ASR

12 years 11 months ago

Download www.speech.sri.com

MLP based front-ends have shown significant complementary properties to conventional spectral features. As part of the DARPA GALE program, different MLP features were developed fo...

Fabio Valente, Mathew Magimai-Doss, Christian Plah...

claim paper

Read More »

8

click to vote

INTERSPEECH
2010

93views Signal Processing» more INTERSPEECH 2010»

What do you mean, you're uncertain?: the interpretation of cue words and rising intonation in dialogue

12 years 11 months ago

Download www.ling.upenn.edu

This paper investigates how rising intonation affects the interpretation of cue words in dialogue. Both cue words and rising intonation express a range of speaker attitudes like u...

Catherine Lai

claim paper

Read More »

11

click to vote

INTERSPEECH
2010

117views Signal Processing» more INTERSPEECH 2010»

Reducing musical noise in blind source separation by time-domain sparse filters and split bregman method

12 years 11 months ago

Download math.uci.edu

Musical noise often arises in the outputs of time-frequency binary mask based blind source separation approaches. Postprocessing is desired to enhance the separation quality. An e...

Wenye Ma, Meng Yu, Jack Xin, Stanley Osher

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers