Due to the ‘semantic gap’ between low-level visual features and the rich semantics in user’s mind, performance of traditional contentbased image retrieval systems is far from...
Automatic lipreading is automatic speech recognition that uses only visual information. The relevant data in a video signal is isolated and features are extracted from it. From a s...
High-dimensional index is one of the most challenging tasks for content-based video retrieval (CBVR). Typically, in video database, there exist two kinds of clues for query: visual...
Zhiping Shi, Qingyong Li, Zhiwei Shi, Zhongzhi Shi
Data Warehouse (DW), On-Line Analytical Processing (OLAP) and Geographical Information System (GIS) are tools for providing decision-making support. Much research is aimed at inte...
In this paper we present the methods and visualizations used in the MediaMill video search engine. The basis for the engine is a semantic indexing process which derives a lexicon ...
Cees Snoek, Dennis Koelma, Giang P. Nguyen, Marcel...