Speech Structure and Its Application to Robust Speech Processing

13 years 3 months ago

Download www.gavo.t.u-tokyo.ac.jp

Speech communication consists of three steps: production, transmission, and hearing. Every step inevitably involves acoustic distortions due to gender diﬀerences, age, microphone- and room-related factors, and so on. In spite of these variations, listeners can extract linguistic information from speech as easily as if the communications had not been aﬀected by variations at all. One may hypothesize that listeners modify their internal acoustic models whenever extralinguistic factors change. Another possibility is that the linguistic information in speech can be represented separately from the extralinguistic factors. In this study, inspired by studies of humans and animals, a novel solution to the problem of intrinsic variations is proposed. Speech structures invariant to these variations are derived as transform-invariant features and their linguistic validity is discussed. Their high robustness is demonstrated by applying the speech structures to automatic speech recognition and ...

Nobuaki Minematsu, Satoshi Asakawa, Masayuki Suzuk

Real-time Traffic

Communications | Extralinguistic Factors | Linguistic Information | NGC 2010 | Speech Structures |

claim paper

» Slovak Speech Database for Experiments and Application Building in UnitSelection Speech Sy...

» A Model for Robust Processing of Spontaneous Speech by Integrating Viable Fragments

» Data Hiding for Speech Bandwidth Extension and its Hardware Implementation

» Robust speech dereverberation based on nonnegativity and sparse nature of speech spectrogr...

» Learning vocal tract variables with multitask kernels

» Combining independent component analysis with geometric information and its application to...

» A Nonlinearized Discriminant Analysis and Its Application to Speech Impediment Therapy

» Robust speech interaction in motorcycle environment

Post Info
More Details (n/a)

Added	29 Jan 2011
Updated	29 Jan 2011
Type	Journal
Year	2010
Where	NGC
Authors	Nobuaki Minematsu, Satoshi Asakawa, Masayuki Suzuki, Yu Qiao

Comments (0)

Sciweavers

Speech Structure and Its Application to Robust Speech Processing

Communications | Extralinguistic Factors | Linguistic Information | NGC 2010 | Speech Structures |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers