Sciweavers

4645 search results - page 169 / 929
» Using Information Extraction to Improve Document Retrieval
Sort
View
CIKM
2010
Springer
15 years 4 months ago
Document allocation policies for selective searching of distributed indexes
Indexes for large collections are often divided into shards that are distributed across multiple computers and searched in parallel to provide rapid interactive search. Typically,...
Anagha Kulkarni, Jamie Callan
SIGIR
2002
ACM
15 years 5 months ago
Two-stage language models for information retrieval
The optimal settings of retrieval parameters often depend on both the document collection and the query, and are usually found through empirical tuning. In this paper, we propose ...
ChengXiang Zhai, John D. Lafferty
ICAPR
2001
Springer
15 years 10 months ago
Character Extraction from Interfering Background - Analysis of Double-Sided Handwritten Archival Documents
The sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage poses a serious problem to human readers or OCR systems. This pape...
Chew Lim Tan, Ruini Cao, Qian Wang, Peiyi Shen
HICSS
2007
IEEE
137views Biometrics» more  HICSS 2007»
16 years 12 days ago
Essential Dimensions of Latent Semantic Indexing (LSI)
Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retrieval applications. LSI has been shown to improve retrieval performance for some, ...
April Kontostathis
ACL
2009
15 years 3 months ago
A Generative Blog Post Retrieval Model that Uses Query Expansion based on External Collections
User generated content is characterized by short, noisy documents, with many spelling errors and unexpected language usage. To bridge the vocabulary gap between the user's in...
Wouter Weerkamp, Krisztian Balog, Maarten de Rijke