SIGIR
13 years 11 months ago
2004 ACM
Many museum and library archives are digitizing their large collections of handwritten historical manuscripts to enable public access to them. These collections are only available...
SIGIR
13 years 11 months ago
2004 ACM
We present a probabilistic model for a document corpus that combines many of the desirable features of previous models. The model is called “GaP” for Gamma-Poisson, the distri...
SIGIR
13 years 11 months ago
2004 ACM
Although text categorization is a burgeoning area of IR research, readily available test collections in this field are surprisingly scarce. We describe a methodology and system (...
SIGIR
13 years 11 months ago
2004 ACM
This paper investigates the level of metadata accuracy required for image filters to be valuable to users. Access to large digital image and video collections is hampered by ambig...
SIGIR
13 years 11 months ago
2004 ACM
We describe an evaluation of result set filtering techniques for providing ultra-high precision in the task of presenting related news for general web queries. In this task, the n...
|