Sciweavers

27 search results - page 2 / 6
» Automatic page analysis for the creation of a digital librar...
Sort
View
IJDLS
2010
131views more  IJDLS 2010»
13 years 2 months ago
Annotating Historical Archives of Images
Recent initiatives like the Million Book Project and Google Print Library Project have already archived several million books in digital format, and within a few years a significa...
Xiaoyue Wang, Lexiang Ye, Eamonn J. Keogh, Christi...
HT
2006
ACM
13 years 10 months ago
Just-in-time recovery of missing web pages
We present Opal, a light-weight framework for interactively locating missing web pages (http status code 404). Opal is an example of “in vivo” preservation: harnessing the col...
Terry L. Harrison, Michael L. Nelson
JCDL
2006
ACM
167views Education» more  JCDL 2006»
13 years 10 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
ICPR
2008
IEEE
13 years 11 months ago
A robust front page detection algorithm for large periodical collections
Large-scale digitization projects aimed at periodicals often have as input streams of completely unlabeled document images. In such situations, the results produced by the automat...
Iuliu Vasile Konya, Christoph Seibert, Sebastian G...
JASIS
1998
172views more  JASIS 1998»
13 years 4 months ago
Speech Recognition for a Digital Video Library
The standard method for making the full content of audio and video material searchable and is to annotate it with humangenerated meta-data that describes the content in a way that...
Michael J. Witbrock, Alexander G. Hauptmann