Sciweavers

TASLP
2010

Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals

13 years 2 months ago
Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals
— Extracting the main melody from a polyphonic music recording seems natural even to untrained human listeners. To a certain extent it is related to the concept of source separation, with the human ability of focusing on a specific source in order to extract relevant information. In this article, we propose a new approach for the estimation and extraction of the main melody (and in particular the leading vocal part) from polyphonic audio signals. To that aim, we propose a new signal model where the leading vocal part is explicitly represented by a specific source/filter model. The proposed representation is investigated in the framework of two statistical models: a Gaussian Scaled Mixture Model (GSMM) and an extended Instantaneous Mixture Model (IMM). For both models, the estimation of the different parameters is done within a maximum likelihood framework adapted from single-channel source separation techniques. The desired sequence of fundamental frequencies is then inferred from...
Jean-Louis Durrieu, Gaël Richard, Bertrand Da
Added 30 Jan 2011
Updated 30 Jan 2011
Type Journal
Year 2010
Where TASLP
Authors Jean-Louis Durrieu, Gaël Richard, Bertrand David, Cédric Févotte
Comments (0)