—We investigate architectures for time encoding and time decoding of visual stimuli such as natural and synthetic video streams (movies, animation). The architecture for time enc...
Depth reconstruction or acquisition with a 3-D camera results in a video sequence where each pixel of a frame is annotated with a depth value. We propose an approach to combine th...
Fabian Ernst, Cornelius W. A. M. van Overveld, Pio...
Tracking body poses of multiple persons in monocular video is a challenging problem due to the high dimensionality of the state space and issues such as inter-occlusion of the pers...
This paper presents a methodology for analyzing multimodal and multiperspective systems for person surveillance. Using an experimental testbed consisting of two color and two infra...
We propose a new approach for combining acoustic and visual measurements to aid in recognizing lip shapes of a person speaking. Our method relies on computing the maximum likeliho...