Sciweavers

INTERSPEECH
2010
12 years 11 months ago
Vocabulary independent spoken query: a case for subword units
In this work, we describe a subword unit approach for information retrieval of items by voice. An algorithm based on the minimum description length (MDL) principle converts an ind...
Evandro B. Gouvêa, Tony Ezzat
INTERSPEECH
2010
12 years 11 months ago
Prominence detection in Swedish using syllable correlates
This paper presents an approach to estimating word level prominence in Swedish using syllable level features. The paper discusses the mismatch problem of annotations between word ...
Samer Al Moubayed, Jonas Beskow
INTERSPEECH
2010
12 years 11 months ago
An intonation model for TTS in sepedi
We present an initial investigation into the acoustic realisation of tone in continuous utterances in Sepedi (a language in the Southern Bantu family). An analytic model for the g...
Daniel R. van Niekerk, Etienne Barnard
INTERSPEECH
2010
12 years 11 months ago
Channel detectors for system fusion in the context of NIST LRE 2009
One of the difficulties in Language Recognition is the variability of the speech signal due to speakers and channels. If channel mismatch is too big and when different categories ...
Florian Verdet, Driss Matrouf, Jean-Françoi...
INTERSPEECH
2010
12 years 11 months ago
Text-based unstressed syllable prediction in Mandarin
Recently, an increasing attention has been paid to Mandarin word stress which is important for improving the naturalness of speech synthesis. Most of the research on Mandarin spee...
Ya Li, Jianhua Tao, Meng Zhang, Shifeng Pan, Xiaoy...
INTERSPEECH
2010
12 years 11 months ago
A discriminative splitting criterion for phonetic decision trees
Phonetic decision trees are a key concept in acoustic modeling for large vocabulary continuous speech recognition. Although discriminative training has become a major line of rese...
Simon Wiesler, Georg Heigold, Markus Nußbaum...
INTERSPEECH
2010
12 years 11 months ago
Hidden Markov models with context-sensitive observations for grapheme-to-phoneme conversion
Hidden Markov models (HMMs) have proven useful in various aspects of speech technology from automatic speech recognition through speech synthesis, speech segmentation and grapheme...
Udochukwu Kalu Ogbureke, Peter Cahill, Julie Carso...
INTERSPEECH
2010
12 years 11 months ago
Autoregressive clustering for HMM speech synthesis
The autoregressive HMM has been shown to provide efficient parameter estimation and high-quality synthesis, but in previous experiments decision trees derived from a non-autoregre...
Matt Shannon, William Byrne
INTERSPEECH
2010
12 years 11 months ago
Constructing Japanese test collections for spoken term detection
Spoken Document Retrieval (SDR) and Spoken Term Detection (STD) have been two of the most intensively investigated topics in spoken document processing research according to the e...
Yoshiaki Itoh, Hiromitsu Nishizaki, Xinhui Hu, Hir...
INTERSPEECH
2010
12 years 11 months ago
Multichannel noise reduction using low order RTF estimate
The relative transfer function generalized sidelobe canceler (RTF-GSC) is a popular method for implementing multichannel speech enhancement. However, an accurate estimation of cha...
Subhojit Chakladar, Nam Soo Kim, Yu Gwang Jin, Tae...