We propose a computational model motivated by human cognitive processes for detecting changes of driving environments. The model, call dynamic visual model, consists of three majo...
Abstract. In this paper we investigate a new method of learning partbased models for visual object recognition, from training data that only provides information about class member...
Visual information has been shown to improve the performance of speech recognition systems in noisy acoustic environments. However, most audio-visual speech recognizers rely on a ...
We integrate automatic speech recognition (ASR) and question answering (QA) to realize a speech-driven QA system, and evaluate its performance. We adapt an Ngram language model to...
Multi-stream hidden Markov models (HMMs) have recently been very successful in audio-visual speech recognition, where the audio and visual streams are fused at the final decision...