Sciweavers

1583 search results - page 221 / 317
» Document Image Retrieval using Signatures as Queries
Sort
View
155
Voted
ICDAR
1997
IEEE
15 years 9 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari
CBMS
2006
IEEE
15 years 11 months ago
Biomedical Ontology MeSH Improves Document Clustering Qualify on MEDLINE Articles: A Comparison Study
Document clustering has been used for better document retrieval, document browsing, and text mining. In this paper, we investigate if biomedical ontology MeSH improves the cluster...
Illhoi Yoo, Xiaohua Hu
SIGMOD
2008
ACM
142views Database» more  SIGMOD 2008»
16 years 5 months ago
Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
Approximate queries on a collection of strings are important in many applications such as record linkage, spell checking, and Web search, where inconsistencies and errors exist in...
Xiaochun Yang, Bin Wang, Chen Li
ECIR
2007
Springer
15 years 6 months ago
Entropy-Based Authorship Search in Large Document Collections
The purpose of authorship search is to identify documents written by a particular author or in a particular style in large document collections. Standard search engines match docum...
Ying Zhao, Justin Zobel
133
Voted
HICSS
2003
IEEE
127views Biometrics» more  HICSS 2003»
15 years 10 months ago
Storage Model for CDA Documents
The Health Level 7 Clinic Document Architecture (CDA) is an XML-based document markup standard that specifies the hierarchical structure and semantics of “clinical documents” ...
Zheng Liang, Peter Bodorik, Michael Shepher