A multi-modal approach for determining speaker location and focus

13 years 9 months ago

Download groups.csail.mit.edu

This paper presents a multi-modal approach to locate a speaker in a scene and determine to whom he or she is speaking. We present a simple probabilistic framework that combines multiple cues derived from both audio and video information. A purely visual cue is obtained using a head tracker to identify possible speakers in a scene and provide both their 3-D positions and orientation. In addition, estimates of the audio signal’s direction of arrival are obtained with the help of a two-element microphone array. A third cue measures the association between the audio and the tracked regions in the video. Integrating these cues provides a more robust solution than using any single cue alone. The usefulness of our approach is shown in our results for video sequences with two or more people in a prototype interactive kiosk environment.

Michael Siracusa, Louis-Philippe Morency, Kevin Wi

Real-time Traffic

ICMI 2003 | Multi-modal Approach | Simple Probabilistic Framework | Two-element Microphone Array |

claim paper

» Multimodal Reference to Objects An Empirical Approach

» MultiImage Focus of Attention for Rapid Site Model Construction

» Wrapping snakes for improved lip segmentation

» Position calibration of audio sensors and actuators in a distributed computing platform

» Automatic visualonly language identification A preliminary study

» Methods for Achieving Fast Query Times in Point Location Data Structures

» Generating Confusion Sets for ContextSensitive Error Correction

» Semantic Methods for P2P Query Routing

Post Info
More Details (n/a)

Added	07 Jul 2010
Updated	07 Jul 2010
Type	Conference
Year	2003
Where	ICMI
Authors	Michael Siracusa, Louis-Philippe Morency, Kevin Wilson, John W. Fisher III, Trevor Darrell

Comments (0)

Sciweavers

A multi-modal approach for determining speaker location and focus

ICMI 2003 | Multi-modal Approach | Simple Probabilistic Framework | Two-element Microphone Array |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers