Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

11

ICASSP
2009
IEEE

favoriteEmaildiscussreport

155views Signal Processing» more ICASSP 2009»

Unsupervised speaker adaptation for telephone call transcription

13 years 2 months ago

Unsupervised speaker adaptation for telephone call transcription

Download research.microsoft.com

The use of the PC and Internet for placing telephone calls will present new opportunities to capture vast amounts of un-transcribed speech for a particular speaker. This paper investigates how to best exploit this data for speaker-dependent speech recognition. Supervised and unsupervised experiments in acoustic model and language model adaptation are presented. Using one hour of automatically transcribed speech per speaker with a word error rate of 36.0%, unsupervised adaptation resulted in an absolute gain of 6.3%, equivalent to 70% of the gain from the supervised case, with additional adaptation data likely to yield further improvements. LM adaptation experiments suggested that although there seems to be a small degree of speaker idiolect, adaptation to the speaker alone, without considering the topic of the conversation, is in itself unlikely to improve transcription accuracy.

R. Wallace, Kishan Thambiratnam, Frank Seide

Real-time Traffic

Additional Adaptation Data | ICASSP 2009 | Language Model Adaptation | Signal Processing | Speaker-dependent Speech Recognition |

claim paper

Related Content

» Unsupervised intraspeaker variability compensation based on Gestalt and model adaptation i...

» CallSurf Automatic Transcription Indexing and Structuration of Call Center Conversational ...

» MACROPHONE An American English Telephone Speech Corpus

» A Simple But Effective Approach to Speaker Tracking in Broadcast News

» Constrained discriminative mapping transforms for unsupervised speaker adaptation

» Vocabulary and language model adaptation using just one speech file

» On speaker adaptive training of artificial neural networks

» Multilingual Speech Databases at LDC

» A study of an irrelevant variability normalization based discriminative training approach ...

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	ICASSP
Authors	R. Wallace, Kishan Thambiratnam, Frank Seide

Comments (0)