Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

28

ECCV
2002
Springer

favoriteEmaildiscussreport

133views Computer Vision» more ECCV 2002»

Probabalistic Models and Informative Subspaces for Audiovisual Correspondence

14 years 11 months ago

Probabalistic Models and Informative Subspaces for Audiovisual Correspondence

Download groups.csail.mit.edu

Abstract. We propose a probabalistic model of single source multimodal generation and show how algorithms for maximizing mutual information can find the correspondences between components of each signal. We show how non-parametric techniques for finding informative subspaces can capture the complex statistical relationship between signals in different modalities. We extend a previous technique for finding informative subspaces to include new priors on the projection weights, yielding more robust results. Applied to human speakers, our model can find the relationship between audio speech and video of facial motion, and partially segment out background events in both channels. We present new results on the problem of audio-visual verification, and show how the audio and video of a speaker can be matched even when no prior model of the speaker's voice or appearance is available.

John W. Fisher III, Trevor Darrell

Real-time Traffic

Complex Statistical Relationship | Computer Vision | ECCV 2002 | Informative Subspaces | Prior Model | Probabalistic Model | Source Multimodal Generation |

claim paper

Related Content

» Learning Joint Statistical Models for AudioVisual Fusion and Segregation

» MultiFrame Optical Flow Estimation using Subspace Constraints

» Toward Intelligent Use of Semantic Information on Subspace Discovery for Image Retrieval

» Facial Deblur Inference Using Subspace Analysis for Recognition of Blurred Faces

» A partial least squares framework for speaker recognition

» What can quantum theory bring to information retrieval

» Subspace Clustering of High Dimensional Data

» Learning image manifolds by semantic subspace projection

» Utilizing affective analysis for efficient movie browsing

Post Info
More Details (n/a)

Added	16 Oct 2009
Updated	16 Oct 2009
Type	Conference
Year	2002
Where	ECCV
Authors	John W. Fisher III, Trevor Darrell

Comments (0)