Sciweavers

68 search results - page 12 / 14
» Modeling Archival Repositories for Digital Libraries
Sort
View
DL
1999
Springer
181views Digital Library» more  DL 1999»
15 years 1 months ago
Quality of OCR for Degraded Text Images
Commercial OCR packages work best with highquality scanned images. They often produce poor results when the image is degraded, either because the original itself was poor quality,...
Roger T. Hartley, Kathleen Crumpton
66
Voted
SIGIR
2004
ACM
15 years 3 months ago
A search engine for imaged documents in PDF files
Large quantities of documents in the Internet and digital libraries are simply scanned and archived in image format, many of which are packed in PDF files. The word search tool pr...
Yue Lu, Li Zhang, Chew Lim Tan
JCDL
2006
ACM
167views Education» more  JCDL 2006»
15 years 3 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
BTW
2007
Springer
152views Database» more  BTW 2007»
15 years 3 months ago
Armada: a Reference Model for an Evolving Database System
Abstract: The data on the web, in digital libraries, in scientific repositories, etc. continues to grow at an increasing rate. Distribution is a key solution to overcome this data...
Fabian Groffen, Martin L. Kersten, Stefan Manegold
ERCIMDL
2010
Springer
163views Education» more  ERCIMDL 2010»
14 years 10 months ago
Determining Time of Queries for Re-ranking Search Results
Abstract. Recent work on analyzing query logs shows that a significant fraction of queries are temporal, i.e., relevancy is dependent on time, and temporal queries play an importan...
Nattiya Kanhabua, Kjetil Nørvåg