Sciweavers

2524 search results - page 193 / 505
» Numerical document queries
Sort
View
DOCENG
2005
ACM
15 years 6 months ago
Managing syntactic variation in text retrieval
Information Retrieval systems are limited by the linguistic variation of language. The use of Natural Language Processing techniques to manage this problem has been studied for a ...
Jesús Vilares, Carlos Gómez-Rodr&iac...
EACL
2006
ACL Anthology
15 years 5 months ago
A Figure of Merit for the Evaluation of Web-Corpus Randomness
In this paper, we present an automated, quantitative, knowledge-poor method to evaluate the randomness of a collection of documents (corpus), with respect to a number of biased pa...
Massimiliano Ciaramita, Marco Baroni
TREC
2003
15 years 5 months ago
Experiments in TREC 2003 Genomics Track at NTT
500,000 PubMed abstracts. However, less than 50 documents are relevant for most queries. Applying scoring to all 500,000 abstracts would create a lot of noise. In the first step, ...
Hirotoshi Taira, Tomonori Izumitani, Tsutomu Hirao...
CORR
2010
Springer
116views Education» more  CORR 2010»
15 years 4 months ago
LiquidXML: Adaptive XML Content Redistribution
We propose to demonstrate LiquidXML, a platform for managing large corpora of XML documents in large-scale P2P networks. All LiquidXML peers may publish XML documents to be shared...
Jesús Camacho-Rodríguez, Asterios Ka...
IPM
2006
83views more  IPM 2006»
15 years 4 months ago
A risk minimization framework for information retrieval
This paper presents a probabilistic information retrieval framework in which the retrieval problem is formally treated as a statistical decision problem. In this framework, querie...
ChengXiang Zhai, John D. Lafferty