This paper proposes a method to improve the quality delivered by statistical parametric speech synthesizers. For this, we use a codebook of pitch-synchronous residual frames, so a...
Thomas Drugman, Alexis Moinet, Thierry Dutoit, Geo...
Active learning has been proven a reliable strategy to reduce manual efforts in training data labeling. Such strategies incorporate the user as oracle: the classifier selects the m...
Unsupervised learning algorithms aim to discover the structure hidden in the data, and to learn representations that are more suitable as input to a supervised machine than the ra...
Multi-class classification schemes typically require human input in the form of precise category names or numbers for each example to be annotated – providing this can be impra...
Ajay Joshi, Fatih Porikli, Nikolaos Papanikolopoul...
In this paper, we propose an approach for fast pedestrian detection in images. Inspired by the histogram of oriented gradient (HOG) features, a set of multi-scale orientation (MSO...