In implicit human computer interaction, computers are required to understand users’ actions and intentions so as to provide proactive services. Visual processing has to detect an...
In this paper, we first discuss the meaning of physical embodiment and the complexity of the environment in the context of multi-agent learning. We then propose a vision-based rei...
This paper addresses the problem of automatic temporal
annotation of realistic human actions in video using mini-
mal manual supervision. To this end we consider two asso-
ciate...
Olivier Duchenne, Ivan Laptev, Josef Sivic, Franci...
We present a compact representation for human action recognition in videos using line and optical flow histograms. We introduce a new shape descriptor based on the distribution of...
Nazli Ikizler, Pinar Duygulu, Ramazan Gokberk Cinb...
The development of user interfaces based on vision and speech requires the solution of a challenging statistical inference problem: The intentions and actions of multiple individu...