In order to improve efficiency of video coding, temporal redundancy between neighboring frames can be reduced. In MPEG-2, some frames, named interframes, are predicted using a mot...
In this paper we describe integrated multimedia processing for Video Scout, a system that segments and indexes TV programs according to their audio, visual, and transcript informa...
Radu S. Jasinschi, Nevenka Dimitrova, Thomas McGee...
In this paper, we propose a word shape recognition method for retrieving image-based documents. Document images are segmented at the word level first. Then the proposed method det...
In this paper, we propose a new method for face detection from cluttered images. We use a polynomial neural network (PNN) for separation of face and non-face patterns while the co...
3D Human face models have been widely used in applications such as face recognition, facial expression recognition, human action recognition, head tracking, facial animation, vide...
Simple wavelet and wavelet packet transforms have often been used for texture characterisation through the analysis of spatial-frequency content. However, most previous methods ma...
Paul R. Hill, David R. Bull, Cedric Nishan Canagar...
This paper covers a method for capturing documents using a digital camera. A typical cheap VGA digital camera (resolution 640 by 480 pixels) does not have adequate resolution to c...