This paper presents a novel audio-visual fusion method for speech detection, which is an important front-end for content-based video processing. This approach aims to extract homo...
Cong Li, Zhijian Ou, Wei Hu, Tao Wang, Yimin Zhang
This paper addresses the problem of developing appropriate features for use in direct modeling approaches to speech recognition, such as those based on Maximum Entropy models or S...
Video contains multiple types of audio and visual information, which are difficult to extract, combine or trade-off in general video information retrieval. This paper provides an ...
In this paper, we propose two approaches for combining geometric information with ICA algorithm to solve permutation problem under the scenario where a rough information about the...
This paper focuses on a grammar-based approach to semantic interpretation, which combines the notions of robust and weighted parsing. In restricted domains of application in infor...