Sciweavers

3090 search results - page 154 / 618
» Document Processing with LinkIT
Sort
View
EXTREME
2004
ACM
15 years 1 months ago
Interpretation Beyond Markup
The meaning conveyed by documents and their markup often goes well beyond what can be inferred from the markup alone. It often depends on context, so that to interpret document ma...
David Dubin, David J. Birnbaum
IR
2007
14 years 10 months ago
Regularizing query-based retrieval scores
In information retrieval, the cluster hypothesis states: closely related documents tend to be relevant to the same request. We exploit this hypothesis directly by adjusting queryb...
Fernando Diaz
ISMIS
2005
Springer
15 years 3 months ago
Identifying Content Blocks from Web Documents
Intelligent information processing systems, such as digital libraries or search engines index web-pages according to their informative content. However, web-pages contain several n...
Sandip Debnath, Prasenjit Mitra, C. Lee Giles
CIKM
2003
Springer
15 years 3 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
ICIP
2000
IEEE
15 years 2 months ago
Hough Technique for Bar Charts Detection and Recognition in Document Images
Charts are common graphic representation for scientific data in technical and business papers. We present a robust system for detecting and recognizing bar charts. The system incl...
Yan Ping Zhou, Chew Lim Tan