Sciweavers

482 search results - page 60 / 97
» Content-Based Retrieval in Digital Libraries
Sort
View
JCDL
2006
ACM
167views Education» more  JCDL 2006»
15 years 5 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
EDBTW
2004
Springer
15 years 5 months ago
Relevance Feedback in XML Retrieval
Highly heterogeneous XML data collections that do not have a global schema, as arising, for example, in federations of digital libraries or scientific data repositories, cannot be...
Hanglin Pan
ECIR
2010
Springer
14 years 12 months ago
Analyzing Information Retrieval Methods to Recover Broken Web Links
In this work we compare different techniques to automatically find candidate web pages to substitute broken links. We extract information from the anchor text, the content of the p...
Juan Martinez-Romo, Lourdes Araujo
CIKM
1999
Springer
15 years 4 months ago
Indexing and Retrieval of Scientific Literature
The web hasgreatly improved accessto scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spreadacrossarchive site...
Steve Lawrence, Kurt D. Bollacker, C. Lee Giles
SIGIR
2004
ACM
15 years 5 months ago
A search engine for imaged documents in PDF files
Large quantities of documents in the Internet and digital libraries are simply scanned and archived in image format, many of which are packed in PDF files. The word search tool pr...
Yue Lu, Li Zhang, Chew Lim Tan