Sciweavers

TASLP
2016
10 years 16 days ago
Robust Quad-Based Audio Fingerprinting
—We propose an audio fingerprinting method that adapts findings from the field of blind astrometry to define simple, efficiently representable characteristic feature combina...
Reinhard Sonnleitner, Gerhard Widmer
TASLP
2016
10 years 16 days ago
Bayesian Analysis of Phoneme Confusion Matrices
Abstract—This paper presents a parametric Bayesian approach to the statistical analysis of phoneme confusion matrices measured for groups of individual listeners in one or more t...
Leijon Leijon, Gustav Eje Henter, Martin Dahlquist
TASLP
2016
10 years 16 days ago
A Fast Method for High-Resolution Voiced/Unvoiced Detection and Glottal Closure/Opening Instant Estimation of Speech
—We propose a fast speech analysis method which simultaneously performs high-resolution voiced/unvoiced detection (VUD) and accurate estimation of glottal closure and glottal ope...
Andreas I. Koutrouvelis, George P. Kafentzis, Niko...
TASLP
2016
10 years 16 days ago
Complex Ratio Masking for Monaural Speech Separation
—Speech separation systems usually operate on the short-time Fourier transform (STFT) of noisy speech, and enhance only the magnitude spectrum while leaving the phase spectrum un...
Donald S. Williamson, Yuxuan Wang, DeLiang Wang
TASLP
2016
10 years 16 days ago
Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement
—Unseen noise estimation is a key yet challenging step to make a speech enhancement algorithm work in adverse environments. At worst, the only prior knowledge we know about the e...
Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas F...