Bottom-up cortical representations of visual conspicuity interact with top-down internal cognitive models of the external world to control eye movements, EMs, and the closely linke...
Claudio M. Privitera, Orazio Gallo, Giorgio Grimol...
When users access information from text, they engage in strategic fixation, visually scanning the text to focus on regions of interest. However, because speech is both serial and ...
Bag-of-words (BoW) methods are a popular class of object recognition methods that use image features (e.g., SIFT) to form visual dictionaries and subsequent histogram vectors to r...
In this paper we develop an algorithm for action recognition and localization in videos. The algorithm uses a figurecentric visual word representation. Different from previous ap...
Automatic detection of communication errors in conversational systems has been explored extensively in the speech community. However, most previous studies have used only acoustic...