Abstract--This paper presents a model for machine aided human translation (MAHT) that integrates source language text and target language acoustic information to produce the text t...
A method is described for estimating the fundamental frequencies of several concurrent sounds in polyphonic music and multiple-speaker speech signals. The method consists of a comp...
The increasing availability of various types of interactive platforms has raised the need for design environments able to support development of user interfaces that exploit intera...
This paper addresses the 3D tracking of pose and animation of the human face in monocular image sequences using deformable 3D models. For each frame, the proposed adaptation is sp...
We present an active learning framework to simultaneously learn appearance and contextual models for scene understanding tasks (multi-class classification). Existing multi-class a...