Sciweavers

8316 search results - page 203 / 1664
» Web Document Modeling
Sort
View
DRR
2003
15 years 5 months ago
Information retrieval for OCR documents: a content-based probabilistic correction model
The difficulty with information retrieval for OCR documents lies in the fact that OCR documents comprise of a significant amount of erroneous words and unfortunately most informat...
Rong Jin, ChengXiang Zhai, Alexander G. Hauptmann
CVPR
2005
IEEE
16 years 6 months ago
3D Geometric and Optical Modeling of Warped Document Images from Scanners
When one scans a document page from a thick bound volume, the curvature of the page to be scanned results in two kinds of distortion in the scanned document images: i) shade along...
Li Zhang, Zheng Zhang 0003, Chew Lim Tan, Tao Xia
CIKM
2008
Springer
15 years 6 months ago
Modeling document features for expert finding
We argue that expert finding is sensitive to multiple document features in an organization, and therefore, can benefit from the incorporation of these document features. We propos...
Jianhan Zhu, Dawei Song, Stefan M. Rüger, Xia...
IJCAI
2007
15 years 5 months ago
Semantic Smoothing of Document Models for Agglomerative Clustering
In this paper, we argue that the agglomerative clustering with vector cosine similarity measure performs poorly due to two reasons. First, the nearest neighbors of a document belo...
Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu
DEXAW
2007
IEEE
97views Database» more  DEXAW 2007»
15 years 10 months ago
Situated Multimodal Documents
—The choices made by user in processing a set of documents is related, in a broad sense, to the sum of influences coming from the documents in the user situation, which does not...
Augusto Celentano, Fabio Pittarello