This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
—In this work we propose a dynamic-texture-based approach to the recognition of facial Action Units (AUs, atomic facial gestures) and their temporal models (i.e., sequences of te...
Recent work on language models for information retrieval has shown that smoothing language models is crucial for achieving good retrieval performance. Many different effective smo...
There has been a large amount of research on efficient document retrieval in both IR and web search areas. One important technique to improve retrieval efficiency is early termina...
This paper describes a bit allocation algorithm that can achieve a constant bit rate when coding multiple video objects (MVO's), while improving the rate-distortion (R-D) per...
Jeong-Woo Lee, Anthony Vetro, Yao Wang, Yo-Sung Ho