Indexes for large collections are often divided into shards that are distributed across multiple computers and searched in parallel to provide rapid interactive search. Typically,...
This paper presents an efficient indexing and retrieval scheme for searching in document image databases. In many non-European languages, optical character recognizers are not very...
Although the availability of large video corpora are on the rise, the value of these datasets remain largely untapped due to the difficulty of analyzing their contents. Automatic ...
In this paper we address the problem of unsupervised Web data extraction. We show that unsupervised Web data extraction becomes feasible when supposing pages that are made up of r...
Associative classification (AC) has been studied in the areas of content-based multimedia retrieval and semantic concept detection due to its high accuracy. The traditional AC alg...
Lin Lin, Mei-Ling Shyu, Guy Ravitz, Shu-Ching Chen