Computer vision systems for human-computer interaction have tended towards more precise forms of interface that require complex vision tasks such as segmentation, tracking, object...
This paper investigates the semantic analysis of broadcast tennis footage. We consider the spatio-temporal behaviour of an object in the footage as being the embodiment of a seman...
When recognizing multiple fonts, geometric features, such as the directional information of strokes, are generally robust against deformation but are weak against degradation. Thi...
In this paper, an approach to detection of caption text in video frames is described. Text recognition in video can be applied to various applications, however there are still pro...
Abstract. The paper describes a three-layer video coder based on spatiotemporal scalability and data partitioning. The base layer represents video sequences with reduced spatial an...