Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

103

Voted

COST
2008
Springer

favoriteEmaildiscussreport

122views Multimedia» more COST 2008»

Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data

15 years 2 months ago

Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data

Download www.logopaedie.rwth-aachen.de

The quality of static phones (e.g. vowels, fricatives, nasals, laterals) generated by articulatory speech synthesizers has reached a high level in the last years. Our goal is to expand this high quality to dynamic speech, i.e. whole syllables, words, and utterances by re-synthesizing natural acoustic speech data. Re-synthesis means that vocal tract action units or articulatory gestures, describing the succession of speech movements, are adapted spatio-temporally with respect to a natural speech signal produced by a natural "model speaker" of Standard German. This adaptation is performed using the software tool SAGA (Sound and Articulatory Gesture Alignment) that is currently under development in our lab. The resulting action unit scores are stored in a database and serve as input for our articulatory speech synthesizer. This technique is designed to be the basis for a unit selection articulatory speech synthesis in the future.

Dominik Bauer, Jim Kannampuzha, Bernd J. Krög

Real-time Traffic

Articulatory Gesture | Articulatory Speech | Articulatory Speech Synthesizer | COST 2008 | Multimedia |

claim paper

Related Content

» Correcting Errors in Speech Recognition with Articulatory Dynamics

» The Organization of a Neurocomputational Control Model for Articulatory Speech Synthesis

» Automatic Speech Recognition Based on Electromyographic Biosignals

» An exploratory study of manifolds of emotional speech

» Borrowing Language Resources for Development of Automatic Speech Recognition for Low and M...

» Gesturebased Dynamic Bayesian Network for noise robust speech recognition

» Auditory universal accessibility of data tables using naturally derived prosody specificat...

» Speech Input from Older Users in Smart Environments Challenges and Perspectives

» Reshaping automatic speech transcripts for robust highlevel spoken document analysis

Post Info
More Details (n/a)

Added	18 Oct 2010
Updated	18 Oct 2010
Type	Conference
Year	2008
Where	COST
Authors	Dominik Bauer, Jim Kannampuzha, Bernd J. Kröger

Comments (0)