Acoustic data sharing for Afghan and Persian languages

14 years 8 months ago

Download mirlab.org

In this work, we compare several known approaches for multilingual acoustic modeling for three languages, Dari, Farsi and Pashto, which are of recent geo-political interest. We demonstrate that we can train a single multilingual acoustic model for these languages and achieve recognition accuracy close to that of monolingual (or language-dependent) models. When only a small amount of training data is available for each of these languages, the multilingual model may even outperform the monolingual ones. We also explore adapting the multilingual model to target language data, which are able to achieve improved automatic speech recognition (ASR) performance compared to the monolingual models for both large and small amounts of training data by 3% relative word error rate (WER).

Arindam Mandal, Dimitra Vergyri, Murat Akbacak, Co

Real-time Traffic

ICASSP 2011 | Multilingual Acoustic Model | Multilingual Acoustic Modeling | Multilingual Model | Signal Processing |

claim paper

» Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptatio...

» Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture M...

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Arindam Mandal, Dimitra Vergyri, Murat Akbacak, Colleen Richey, Andreas Kathol

Comments (0)

Sciweavers

Acoustic data sharing for Afghan and Persian languages

ICASSP 2011 | Multilingual Acoustic Model | Multilingual Acoustic Modeling | Multilingual Model | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers