Sciweavers

Share
TASLP
2016

Packet Loss Concealment Based on Deep Neural Networks for Digital Speech Transmission

4 years 2 months ago
Packet Loss Concealment Based on Deep Neural Networks for Digital Speech Transmission
—In this paper, we propose the regression-based packet loss concealment (PLC) for digital speech transmission by using deep neural networks (DNNs) with a multiple-layer deep architecture. For the DNN training, log-power spectra and phases are employed as features in the input layer for the large training set, which ensures non-linear mapping the frames from the last correctly received frame to the missing frame. Once the training is accomplished by the restricted Boltzmann machine (RBM)-based pre-training to initialize the DNN, minimum mean square error (MMSE)-based fine tuning is then performed based on the back-propagation algorithm. In the reconstruction stage, the trained DNN model is fed with the features of the previous frames in order to estimate the log-power spectra and phases of the missing frames. Reconstruction is further improved by using the cross-fading technique to mitigate discontinuity between the reconstruction signal and good frame signal in the time-domain. To d...
Bong-Ki Lee, Joon-Hyuk Chang
Added 10 Apr 2016
Updated 10 Apr 2016
Type Journal
Year 2016
Where TASLP
Authors Bong-Ki Lee, Joon-Hyuk Chang
Comments (0)
books