Sciweavers

INTERSPEECH
2010
12 years 11 months ago
HMM-based text-to-articulatory-movement prediction and analysis of critical articulators
In this paper we present a method to predict the movement of a speaker's mouth from text input using hidden Markov models (HMM). We have used a corpus of human articulatory m...
Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi
INTERSPEECH
2010
12 years 11 months ago
Hierarchical multilayer perceptron based language identification
Automatic language identification (LID) systems generally exploit acoustic knowledge, possibly enriched by explicit language specific phonotactic or lexical constraints. This pape...
David Imseng, Mathew Magimai-Doss, Hervé Bo...
INTERSPEECH
2010
12 years 11 months ago
A novel hybrid approach for Mandarin speech synthesis
The paper investigates a new method to solve concatenation problems of Mandarin speech synthesis which is based on the hybrid approach of HMM-based speech synthesis and unit selec...
Shifeng Pan, Meng Zhang, Jianhua Tao
INTERSPEECH
2010
12 years 11 months ago
Topic and style-adapted language modeling for Thai broadcast news ASR
The amount of available Thai broadcast news transcribed text for training a language model is still very limited, comparing to other major languages. Since the construction of a b...
Markpong Jongtaveesataporn, Sadaoki Furui
INTERSPEECH
2010
12 years 11 months ago
Rapid development of speech translation using consecutive interpretation
The development of a speech translation (ST) system is costly, largely because it is expensive to collect parallel data. A new language pair is typically only considered in the af...
Matthias Paulik, Alex Waibel
INTERSPEECH
2010
12 years 11 months ago
Adaptation of a tongue shape model by local feature transformations
Reconstructing the full contour of the tongue from the position of 3 to 4 landmarks on it is useful in articulatory speech work. This can be done with submillimetric accuracy usin...
Chao Qin, Miguel Á. Carreira-Perpiñ&...
INTERSPEECH
2010
12 years 11 months ago
Modeling liaison in French by using decision trees
French is known to be a language with major pronunciation irregularities at word endings with consonants. Particularly, the well-known phonetic phenomenon called Liaison is one of...
Josafá de Jesus Aguiar Pontes, Sadaoki Furu...
INTERSPEECH
2010
12 years 11 months ago
A classifier-based target cost for unit selection speech synthesis trained on perceptual data
Our goal is to automatically learn a perceptually-optimal target cost function for a unit selection speech synthesiser. The approach we take here is to train a classifier on human...
Volker Strom, Simon King
INTERSPEECH
2010
12 years 11 months ago
Robust and efficient pitch estimation using an iterative ARMA technique
In this article, we propose an innovative way of estimating pitch from speech waveform data, using an iterative ARMA technique that efficiently estimates multiple frequency compon...
Jung Ook Hong, Patrick J. Wolfe
INTERSPEECH
2010
12 years 11 months ago
Artificial and online acquired noise dictionaries for noise robust ASR
Recent research has shown that speech can be sparsely represented using a dictionary of speech segments spanning multiple frames, exemplars, and that such a sparse representation ...
Jort F. Gemmeke, Tuomas Virtanen