Sciweavers

1319 search results - page 146 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
140
Voted
ECIR
2010
Springer
15 years 5 months ago
Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees
Text clustering is an established technique for improving quality in information retrieval, for both centralized and distributed environments. However, for highly distributed envir...
Odysseas Papapetrou, Wolf Siberski, Norbert Fuhr
121
Voted
SIGIR
2009
ACM
15 years 10 months ago
Reducing long queries using query quality predictors
Long queries frequently contain many extraneous terms that hinder retrieval of relevant documents. We present techniques to reduce long queries to more effective shorter ones tha...
Giridhar Kumaran, Vitor R. Carvalho
127
Voted
IPM
2007
95views more  IPM 2007»
15 years 3 months ago
Using structural contexts to compress semistructured text collections
We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...
Joaquín Adiego, Gonzalo Navarro, Pablo de l...
114
Voted
CORR
2008
Springer
176views Education» more  CORR 2008»
15 years 3 months ago
An evaluation of Bradfordizing effects
The purpose of this paper is to apply and evaluate the bibliometric method Bradfordizing for information retrieval (IR) experiments. Bradfordizing is used for generating core docu...
Philipp Mayr
AIRWEB
2009
Springer
15 years 10 months ago
Looking into the past to better classify web spam
Web spamming techniques aim to achieve undeserved rankings in search results. Research has been widely conducted on identifying such spam and neutralizing its influence. However,...
Na Dai, Brian D. Davison, Xiaoguang Qi