Sciweavers

821 search results - page 83 / 165
» Retrieval from Document Image Collections
Sort
View
JCDL
2005
ACM
100views Education» more  JCDL 2005»
15 years 5 months ago
What's there and what's not?: focused crawling for missing documents in digital libraries
Some large scale topical digital libraries, such as CiteSeer, harvest online academic documents by crawling open-access archives, university and author homepages, and authors’ s...
Ziming Zhuang, Rohit Wagle, C. Lee Giles
LREC
2008
106views Education» more  LREC 2008»
15 years 1 months ago
Producing an Encyclopedic Dictionary using Patent Documents
Although the World Wide Web has of late become an important source to consult for the meaning of words, a number of technical terms related to high technology are not found on the...
Atsushi Fujii
CIKM
2006
Springer
15 years 3 months ago
Multi-evidence, multi-criteria, lazy associative document classification
We present a novel approach for classifying documents that combines different pieces of evidence (e.g., textual features of documents, links, and citations) transparently, through...
Adriano Veloso, Wagner Meira Jr., Marco Cristo, Ma...
DEXA
2006
Springer
193views Database» more  DEXA 2006»
15 years 3 months ago
Understanding and Enhancing the Folding-In Method in Latent Semantic Indexing
Abstract. Latent Semantic Indexing(LSI) has been proved to be effective to capture the semantic structure of document collections. It is widely used in content-based text retrieval...
Xiang Wang 0002, Xiaoming Jin
JCDL
2004
ACM
128views Education» more  JCDL 2004»
15 years 5 months ago
Panorama: extending digital libraries with topical crawlers
A large amount of research, technical and professional documents are available today in digital formats. Digital libraries are created to facilitate search and retrieval of inform...
Gautam Pant, Kostas Tsioutsiouliklis, Judy Johnson...