We previously proposed a decoding method for automatic speech recognition utilizing hypothesis scores weighted by voice activity detection (VAD)-measures. This method uses two Gau...
Abstract. Score functions induced by generative models extract fixeddimensions feature vectors from different-length data observations by subsuming the process of data generation, ...
Alessandro Perina, Marco Cristani, Umberto Castell...
The following article presents a novel, adaptive initialization scheme that can be applied to most state-of-the-art Speaker Diarization algorithms, i.e. algorithms that use agglom...
Ordinal regression has become an effective way of learning user preferences, but most of research only focuses on single regression problem. In this paper we introduce collaborati...
Shipeng Yu, Kai Yu, Volker Tresp, Hans-Peter Krieg...
Acoustic anger detection in voice portals can help to enhance human computer interaction. A comprehensive voice portal data collection has been carried out and gives new insight o...
Felix Burkhardt, Tim Polzehl, Joachim Stegmann, Fl...