Sciweavers

TSD
2005
Springer

Why Is the Recognition of Spontaneous Speech so Hard?

13 years 9 months ago
Why Is the Recognition of Spontaneous Speech so Hard?
Although speech, derived from reading texts, and similar types of speech, e.g. that from reading newspapers or that from news broadcast, can be recognized with high accuracy, recognition accuracy drastically decreases for spontaneous speech. This is due to the fact that spontaneous speech and read speech are significantly different acoustically as well as linguistically. This paper reports analysis and recognition of spontaneous speech using a large-scale spontaneous speech database “Corpus of Spontaneous Japanese (CSJ)”. Recognition results in this experiment show that recognition accuracy significantly increases as a function of the size of acoustic as well as language model training data and the improvement levels off at approximately 7M words of training data. This means that acoustic and linguistic variation of spontaneous speech is so large that we need a very large corpus in order to encompass the variations. Spectral analysis using various styles of utterances in the CS...
Sadaoki Furui, Masanobu Nakamura, Tomohisa Ichiba,
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where TSD
Authors Sadaoki Furui, Masanobu Nakamura, Tomohisa Ichiba, Koji Iwano
Comments (0)