Graceful degradation of speech recognition performance over packet-erasure networks

13 years 9 months ago

Download dcl.ee.washington.edu

Abstract--This paper explores packet loss recovery for automatic speech recognition (ASR) in spoken dialog systems, assuming an architecture in which a lightweight client communicates with a remote ASR server. Speech is transmitted with source and channel codes optimized for the ASR application, i.e., to minimize word error rate. Unequal amounts of forward error correction, depending on the data's effect on ASR performance, are assigned to protect against packet loss. Experiments with simulated packet loss in a range of loss conditions are conducted on the DARPA Communicator (air travel information) task. Results show that the approach provides robust ASR performance which degrades gracefully as packet loss rates increase. Transmitting at 5.2 Kbps with up to 200 ms added delay, leads to only a 7% relative degradation in word error rate even under extremely adverse network conditions.

Constantinos Boulis, Mari Ostendorf, Eve A. Riskin

Real-time Traffic

ASR Performance | Packet Loss | TASLP 2002 | Word Error Rate |

claim paper

Added	23 Dec 2010
Updated	23 Dec 2010
Type	Journal
Year	2002
Where	TASLP
Authors	Constantinos Boulis, Mari Ostendorf, Eve A. Riskin, Scott Otterson

Sciweavers

Graceful degradation of speech recognition performance over packet-erasure networks

ASR Performance | Packet Loss | TASLP 2002 | Word Error Rate |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers