Blind audiovisual separation based on redundant representations

14 years 3 months ago

Download infoscience.epfl.ch

In this work we present a method to perform a complete audiovisual source separation without need of previous information. This method is based on the assumption that sounds are caused by moving structures. Thus, an efﬁcient representation of audio and video sequences allows to build relationships between synchronous structures on both modalities. A robust clustering algorithm groups video structures exhibiting strong correlations with the audio so that sources are counted and located in the image. Using such information and exploiting audio-video correlation, the audio sources activity is determined. Next, spectral Gaussian Mixture Models (GMMs) are learnt in time slots with only one source active so that it is possible to separate them in case of an audio mixture. Audio source separation performances are rigorously evaluated, clearly showing that the proposed algorithm performs efﬁciently and robustly.

Anna Llagostera Casanovas, Gianluca Monaci, Pierre

Real-time Traffic

Audio Source | Audiovisual Source Separation | ICASSP 2008 | Signal Processing | Source Separation |

claim paper

» Encoding large array signals into a 3D sound field representation for selective listening ...

» Morphological Diversity and Sparsity in Blind Source Separation

» A null space method for overcomplete blind source separation

» Bayesian blind source separation for brain imaging

» A Mathematical Formalism for the Evaluation of CSpace for Redundant Robots

» Informationdriven interactionoriented programming BSPL the blindingly simple protocol lang...

» Sparsity And Morphological Diversity For Hyperspectral Data Analysis

» Degenerate Unmixing Estimation Technique using the Constant Q Transform

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Anna Llagostera Casanovas, Gianluca Monaci, Pierre Vandergheynst, Rémi Gribonval

Comments (0)

Sciweavers

Blind audiovisual separation based on redundant representations

Audio Source | Audiovisual Source Separation | ICASSP 2008 | Signal Processing | Source Separation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers