Sciweavers

4 search results - page 1 / 1
» A robust front page detection algorithm for large periodical...
Sort
View
ICPR
2008
IEEE
13 years 11 months ago
A robust front page detection algorithm for large periodical collections
Large-scale digitization projects aimed at periodicals often have as input streams of completely unlabeled document images. In such situations, the results produced by the automat...
Iuliu Vasile Konya, Christoph Seibert, Sebastian G...
SIGIR
2008
ACM
13 years 4 months ago
SpotSigs: robust and efficient near duplicate detection in large web collections
Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...
Martin Theobald, Jonathan Siddharth, Andreas Paepc...
CVPR
2003
IEEE
14 years 6 months ago
Document Image Enhancement Using Directional Wavelet
This paper proposes a novel algorithm to clean up a large collection of historical handwritten documents kept in the National Archives of Singapore. Due to the seepage of ink over...
Qian Wang, Tao Xia, Lida Li, Chew Lim Tan
JCDL
2009
ACM
162views Education» more  JCDL 2009»
13 years 11 months ago
Supporting analysis of future-related information in news archives and the web
A lot of future-related information is available in news articles or Web pages. This information can however differ to large extent and may fluctuate over time. It is therefore di...
Adam Jatowt, Kensuke Kanazawa, Satoshi Oyama, Kats...