Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

14 years 11 months ago

Download perso.telecom-paristech.fr

Separating multiple tracks from professionally produced music recordings (PPMRs) is still a challenging problem. We address this task with a user-guided approach in which the separation system is provided segmental information indicating the time activations of the particular instruments to separate. This information may typically be retrieved from manual annotation. We use a so-called multichannel nonnegative tensor factorization (NTF) model, in which the original sources are observed through a multichannel convolutive mixture and in which the source power spectrograms are jointly modeled by a 3-valence (time/frequency/source) tensor. Our user-guided separation method produced competitive results at the 2010 Signal Separation Evaluation Campaign, with sufﬁcient quality for real-world music editing applications.

Alexey Ozerov, Cédric Févotte, Rapha

Real-time Traffic

ICASSP 2011 | Multichannel Convolutive Mixture | Multichannel Nonnegative Tensor | Signal Processing | Signal Separation Evaluation |

claim paper

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Alexey Ozerov, Cédric Févotte, Raphaël Blouet, Jean-Louis Durrieu

Comments (0)

Sciweavers

Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

ICASSP 2011 | Multichannel Convolutive Mixture | Multichannel Nonnegative Tensor | Signal Processing | Signal Separation Evaluation |

Explore & Download

Productivity Tools

Sciweavers