Sciweavers

ICASSP
2008
IEEE
13 years 11 months ago
Discriminative training by iterative linear programming optimization
In this paper, we cast discriminative training problems into standard linear programming (LP) optimization. Besides being convex and having globally optimal solution(s), LP progra...
Brian Mak, Benny Ng
ICASSP
2008
IEEE
13 years 11 months ago
Quality evaluation of the G.EV-VBR speech codec
ITU-T has selected the candidate submitted by Ericsson, Nokia, Motorola, VoiceAge, and Texas Instruments as the baseline for the G.EV-VBR coding standard. G.EV-VBR is an embedded ...
Anssi Rämö, Henri Toukomaa, S. Craig Gre...
ICASSP
2008
IEEE
13 years 11 months ago
Toward a detector-based universal phone recognizer
In recent research, we have proposed a high-accuracy bottom-up detection-based paradigm for continuous phone speech recognition. The key component of our system was a bank of arti...
Sabato Marco Siniscalchi, Torbjørn Svendsen...
ICASSP
2008
IEEE
13 years 11 months ago
Modified polyphone decision tree specialization for porting multilingual Grapheme based ASR systems to new languages
Automatic speech recognition (ASR) systems have been developed only for a very limited number of the estimated 7,000 languages in the world. In order to avoid the evolvement of a ...
Sebastian Stüker
ICASSP
2008
IEEE
13 years 11 months ago
Fine-grained pitch accent and boundary tone labeling with parametric F0 features
Motivated by linguistic theories of prosodic categoricity, symbolic representations of prosody have recently attracted the attention of speech technologists. Categorical represent...
Sankaranarayanan Ananthakrishnan, Shrikanth Naraya...
ICASSP
2008
IEEE
13 years 11 months ago
Extracting clues from human interpreter speech for spoken language translation
In previous work, we reported dramatic improvements in automatic speech recognition (ASR) and spoken language translation (SLT) gained by applying information extracted from spoke...
Matthias Paulik, Alex Waibel
ICASSP
2008
IEEE
13 years 11 months ago
Theoretical statistical correlation for biometric identification performance
Measurement and evaluation of biometric device performance is critical to end users and consumers of these devices. In this paper we present explicit theoretical correlation model...
Michael E. Schuckers
ICASSP
2008
IEEE
13 years 11 months ago
Text-independent voice conversion based on state mapped codebook
Voice conversion has become more and more important in speech technology, but most of current works have to use parallel utterances of both source and target speaker as the traini...
Meng Zhang, Jianhua Tao, Jilei Tian, Xia Wang
ICASSP
2008
IEEE
13 years 11 months ago
Gaze-contingent asr for spontaneous, conversational speech: An evaluation
There has been little work that attempts to improve the recognition of spontaneous, conversational speech by adding information from a loosely-coupled modality. This study investi...
Neil Cooke, Martin J. Russell