138
click to vote
SIGIR
15 years 6 months ago
2004 ACM
We present a probabilistic model for a document corpus that combines many of the desirable features of previous models. The model is called “GaP” for Gamma-Poisson, the distri...
135
Voted
SIGIR
15 years 6 months ago
2004 ACM
Many museum and library archives are digitizing their large collections of handwritten historical manuscripts to enable public access to them. These collections are only available...
131
Voted
SIGIR
15 years 6 months ago
2004 ACM
The exponential growth of data demands scalable infrastructures capable of indexing and searching rich content such as text, music, and images. A promising direction is to combine...
128
Voted
SIGIR
15 years 6 months ago
2004 ACM
Document-centric XML collections contain text-rich documents, marked up with XML tags. The tags add lightweight semantics to the text. Querying such collections calls for a hybrid...
128
Voted
SIGIR
15 years 6 months ago
2004 ACM
This paper reports on and discusses a set of user experiments using the TREC 2003 Web interactive track protocol. The focus is on comparing humans and machine algorithms in terms ...
|