It is now accepted that the most eective video shot retrieval is based on indexing and retrieving clips using multiple, parallel modalities such as text-matching, image-matching a...
Abstract. The Scale Invariant Feature Transform (SIFT) has become a popular feature extractor for vision-based applications. It has been successfully applied to metric localization...
The task of registering video frames with a static model is a common problem in many computer vision domains. The standard approach to registration involves finding point correspo...
In recent years, the language model Latent Dirichlet Allocation (LDA), which clusters co-occurring words into topics, has been widely applied in the computer vision field. Howeve...
We present a novel approach to estimating depth from single omnidirectional camera images by learning the relationship between visual features and range measurements available dur...