Abstract--This paper presents a novel method for automatically classifying consumer video clips based on their soundtracks. We use a set of 25 overlapping semantic classes, chosen ...
The recognition of transitive, goal-directed actions requires a sensible balance between the representation of specific shape details of effector and goal object and robustness w...
This paper reports on a UK ESRC-funded project studying representations of practice, in video clips and voice annotations, for professional collaborative learning in distributed o...
In this work we present a method to perform a complete audiovisual source separation without need of previous information. This method is based on the assumption that sounds are c...
Anna Llagostera Casanovas, Gianluca Monaci, Pierre...
In this paper, we propose a new manifold representation capable of being applied for visual speech recognition. In this regard, the real time input video data is compressed using P...
Dahai Yu, Ovidiu Ghita, Alistair Sutherland, Paul ...