We investigate the issue of sign language automatic phonetic subunit modeling, that is completely data driven and without any prior phonetic information. A first step of visual p...
This contribution addresses the problem of obtaining photorealistic 3D models of a scene from images alone with a structure-from-motion approach. The 3D scene is observed from mul...
We consider the problem of unsupervised classification of temporal sequences of facial expressions in video. This problem arises in the design of an adaptive visual agent, which m...
Multi-agent interactions often result in mutual occlusion sequences which constitute a visual signature for the event. We define six qualitative occlusion primitives based on the ...
Amitabha Mukerjee, K. S. Venkatesh, Pabitra Mitra,...
Two temporally scalable video coding techniques, temporal subband coding (TSB) and predictive coding, are evaluated both theoretically and in practice to provide comparisons of co...