Sciweavers

PAMI
2007
185views more  PAMI 2007»
13 years 4 months ago
Restoring 2D Content from Distorted Documents
—This paper presents a framework to restore the 2D content printed on documents in the presence of geometric distortion and nonuniform illumination. Compared with text-based docu...
Michael S. Brown, Mingxuan Sun, Ruigang Yang, Lin ...
CORR
2010
Springer
43views Education» more  CORR 2010»
13 years 4 months ago
Vcache: Caching Dynamic Documents
---- The traditional web caching is currently limited to static documents only. A page generated on the fly from a server side script may have different contents on different acces...
Vipul Goyal, Sugata Sanyal, Dharma P. Agrawal
TKDE
2002
121views more  TKDE 2002»
13 years 4 months ago
ACIRD: Intelligent Internet Document Organization and Retrieval
This paper presents an intelligent Internet information system, Automatic Classifier for the Internet Resource Discovery (ACIRD), which uses machine learning techniques to organiz...
Shian-Hua Lin, Meng Chang Chen, Jan-Ming Ho, Yueh-...
TC
2002
13 years 4 months ago
Dynamically Selecting Optimal Distribution Strategies for Web Documents
To improve the scalability of the Web it is common practice to apply caching and replication techniques. Numerous strategies for placing and maintaining multiple copies of Web doc...
Guillaume Pierre, Maarten van Steen, Andrew S. Tan...
SIGIR
2002
ACM
13 years 4 months ago
Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century?
mes, abstracts and year of publication of all 853 papers published.1 We then applied Porter stemming and stopword removal to this text, represented terms from the elds with twice t...
Alan F. Smeaton, Gary Keogh, Cathal Gurrin, Kieran...
SIGIR
2002
ACM
13 years 4 months ago
Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering
A novel method for simultaneous keyphrase extraction and generic text summarization is proposed by modeling text documents as weighted undirected and weighted bipartite graphs. Sp...
Hongyuan Zha
SIGIR
2002
ACM
13 years 4 months ago
Liberal relevance criteria of TREC -: counting on negligible documents?
Most test collections (like TREC and CLEF) for experimental research in information retrieval apply binary relevance assessments. This paper introduces a four-point relevance scal...
Eero Sormunen
PAMI
2002
94views more  PAMI 2002»
13 years 4 months ago
Imaged Document Text Retrieval Without OCR
: We propose a method for text retrieval from document images without the use of OCR. Documents are segmented into character objects. Image features, namely the Vertical Traverse D...
Chew Lim Tan, Weihua Huang, Zhaohui Yu, Yi Xu
PAMI
2002
69views more  PAMI 2002»
13 years 4 months ago
Restoration of Archival Documents Using a Wavelet Technique
Chew Lim Tan, Ruini Cao, Peiyi Shen
MTA
1998
157views more  MTA 1998»
13 years 4 months ago
Brahma: Browsing and Retrieval Architecture for Hierarchical Multimedia Annotation
Traditional browsing of large multimedia documents (e.g., video, audio) is primarily sequential. In the absence of an index structure browsing and searching for relevant informatio...
Asit Dan, Dinkar Sitaram, Junehwa Song