105
click to vote
SIGIR
15 years 2 months ago
2004 ACM
Many museum and library archives are digitizing their large collections of handwritten historical manuscripts to enable public access to them. These collections are only available...
98
Voted
SIGIR
15 years 2 months ago
2004 ACM
We present a probabilistic model for a document corpus that combines many of the desirable features of previous models. The model is called “GaP” for Gamma-Poisson, the distri...
SIGIR
15 years 2 months ago
2004 ACM
Noun phrases in queries are identified and classified into four types: proper names, dictionary phrases, simple phrases and complex phrases. A document has a phrase if all content...
SIGIR
15 years 2 months ago
2004 ACM
The exponential growth of data demands scalable infrastructures capable of indexing and searching rich content such as text, music, and images. A promising direction is to combine...
SIGIR
15 years 2 months ago
2004 ACM
This paper reports on and discusses a set of user experiments using the TREC 2003 Web interactive track protocol. The focus is on comparing humans and machine algorithms in terms ...
|