Sciweavers

ICASSP
2011
IEEE

Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

13 years 2 months ago
Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation
Separating multiple tracks from professionally produced music recordings (PPMRs) is still a challenging problem. We address this task with a user-guided approach in which the separation system is provided segmental information indicating the time activations of the particular instruments to separate. This information may typically be retrieved from manual annotation. We use a so-called multichannel nonnegative tensor factorization (NTF) model, in which the original sources are observed through a multichannel convolutive mixture and in which the source power spectrograms are jointly modeled by a 3-valence (time/frequency/source) tensor. Our user-guided separation method produced competitive results at the 2010 Signal Separation Evaluation Campaign, with sufficient quality for real-world music editing applications.
Alexey Ozerov, Cédric Févotte, Rapha
Added 21 Aug 2011
Updated 21 Aug 2011
Type Journal
Year 2011
Where ICASSP
Authors Alexey Ozerov, Cédric Févotte, Raphaël Blouet, Jean-Louis Durrieu
Comments (0)