Multi-camera networks bring in potentials for a variety of vision-based applications through provisioning of rich visual information. In this paper a method of image segmentation f...
Computed tomography (CT) images have been widely used for diagnosis of liver disease and volume measurement for liver surgery or transplantation. Automatic liver segmentation and ...
In this paper, we propose a generative model-based approach for audio-visual event classification. This approach is based on a new unsupervised learning method using an extended p...
Ming Li, Sanqing Hu, Shih-Hsi Liu, Sung Baang, Yu ...
We propose a new two-stage framework for joint analysis of head gesture and speech prosody patterns of a speaker toward automatic realistic synthesis of head gestures from speech p...
We describe how to model the appearance of a 3-D object using multiple views, learn such a model from training images, and use the model for object recognition. The model uses pro...