Sampling multisensory information and taking the appropriate motor action is critical for a biological organism’s survival, but a difficult task for robots. We present a Neurally...
We propose a joint representation and classification framework that achieves the dual goal of finding the most discriminative sparse overcomplete encoding and optimal classifier p...
Abstract. In numerous content-based video applications, it is important to extract from a video sequence a representation for humans in motion. This task is di cult, because humans...
Aesthetic features such as animation, 3D interaction, and visual metaphors are becoming commonplace in multimedia search interfaces. However, it is unclear which attributes are ne...
Detecting different categories of objects in image and video content is one of the fundamental tasks in computer vision research. The success of many applications such as visual s...