NLP on Spoken Documents Without ASR

15 years 5 months ago

Download www.cs.jhu.edu

There is considerable interest in interdisciplinary combinations of automatic speech recognition (ASR), machine learning, natural language processing, text classification and information retrieval. Many of these boxes, especially ASR, are often based on considerable linguistic resources. We would like to be able to process spoken documents with few (if any) resources. Moreover, connecting black boxes in series tends to multiply errors, especially when the key terms are out-ofvocabulary (OOV). The proposed alternative applies text processing directly to the speech without a dependency on ASR. The method finds long ( 1 sec) repetitions in speech, and clusters them into pseudo-terms (roughly phrases). Document clustering and classification work surprisingly well on pseudoterms; performance on a Switchboard task approaches a baseline using gold standard manual transcriptions.

Mark Dredze, Aren Jansen, Glen Coppersmith, Ken Wa

Real-time Traffic

Considerable Linguistic Resources | EMNLP 2010 | Natural Language Processing | Standard Manual Transcriptions |

claim paper

Added	11 Feb 2011
Updated	11 Feb 2011
Type	Journal
Year	2010
Where	EMNLP
Authors	Mark Dredze, Aren Jansen, Glen Coppersmith, Ken Ward Church

Sciweavers

NLP on Spoken Documents Without ASR

Considerable Linguistic Resources | EMNLP 2010 | Natural Language Processing | Standard Manual Transcriptions |

Explore & Download

Productivity Tools

Sciweavers