Sciweavers

1319 search results - page 47 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
TREC
2004
15 years 4 months ago
HARD Track Overview in TREC 2004 - High Accuracy Retrieval from Documents
The HARD track of TREC 2004 aims to improve the accuracy of information retrieval through the use of three techniques: (1) query metadata that better describes the information nee...
James Allan
ECIR
2007
Springer
15 years 4 months ago
A Bayesian Approach for Learning Document Type Relevance
Retrieval accuracy can be improved by considering which document type should be filtered out and which should be ranked higher in the result list. Hence, document type can be used...
Peter C. K. Yeung, Stefan Büttcher, Charles L...
CACM
1998
110views more  CACM 1998»
15 years 3 months ago
Viewing WISs as Database Applications
abstraction for modeling these problems is to view the Web as a collection of (usually small and heterogeneous) databases, and to view programs that extract and process Web data au...
Gustavo O. Arocena, Alberto O. Mendelzon
WWW
2005
ACM
16 years 3 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
ECIR
2010
Springer
15 years 4 months ago
Using the Quantum Probability Ranking Principle to Rank Interdependent Documents
A known limitation of the Probability Ranking Principle (PRP) is that it does not cater for dependence between documents. Recently, the Quantum Probability Ranking Principle (QPRP)...
Guido Zuccon, Leif Azzopardi