Cheap, Fast and Good Enough: Automatic Speech Recognition with Non-Expert Transcription

13 years 2 months ago

Download www.cs.jhu.edu

Deploying an automatic speech recognition system with reasonable performance requires expensive and time-consuming in-domain transcription. Previous work demonstrated that non-professional annotation through Amazon's Mechanical Turk can match professional quality. We use Mechanical Turk to transcribe conversational speech for as little as one thirtieth the cost of professional transcription. The higher disagreement of non-professional transcribers does not have a significant effect on system performance. While previous work demonstrated that redundant transcription can improve data quality, we found that resources are better spent collecting more data. Finally, we describe a quality control method without needing professional transcription.

Scott Novotney, Chris Callison-Burch

Real-time Traffic

Automatic Speech Recognition | Computational Linguistics | Mechanical Turk | NAACL 2010 | Professional Transcription |

claim paper

Post Info
More Details (n/a)

Added	14 Feb 2011
Updated	14 Feb 2011
Type	Journal
Year	2010
Where	NAACL
Authors	Scott Novotney, Chris Callison-Burch

Comments (0)

Sciweavers

Cheap, Fast and Good Enough: Automatic Speech Recognition with Non-Expert Transcription

Automatic Speech Recognition | Computational Linguistics | Mechanical Turk | NAACL 2010 | Professional Transcription |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers