Sciweavers

SIGIR
2009
ACM
13 years 11 months ago
Approximating true relevance distribution from a mixture model based on irrelevance data
Pseudo relevance feedback (PRF), which has been widely applied in IR, aims to derive a distribution from the top n pseudo relevant documents D. However, these documents are often ...
Peng Zhang, Yuexian Hou, Dawei Song
SIGIR
2009
ACM
13 years 11 months ago
Context transfer in search advertising
We define and study the process of context transfer in search advertising, which is the transition of a user from the context of Web search to the context of the landing page tha...
Hila Becker, Andrei Z. Broder, Evgeniy Gabrilovich...
SIGIR
2009
ACM
13 years 11 months ago
A graph-based approach to mining multilingual word associations from wikipedia
In this paper, we propose a graph-based approach to constructing a multilingual association dictionary from Wikipedia, in which we exploit two kinds of links in Wikipedia articles...
Zheng Ye, Xiangji Huang, Hongfei Lin
SIGIR
2009
ACM
13 years 11 months ago
Protein identification as an information retrieval problem
We present the first interdisciplinary work on transforming a popular problem in proteomics, i.e. protein identification from tandem mass spectra, to an Information Retrieval (IR)...
Yiming Yang, Subramaniam Ganapathy, Abhay Harpale
SIGIR
2009
ACM
13 years 11 months ago
Two-stage query segmentation for information retrieval
Modeling term dependence has been shown to have a significant positive impact on retrieval. Current models, however, use sequential term dependencies, leading to an increased que...
Michael Bendersky, W. Bruce Croft, David A. Smith
SIGIR
2009
ACM
13 years 11 months ago
Spam filter evaluation with imprecise ground truth
When trained and evaluated on accurately labeled datasets, online email spam filters are remarkably effective, achieving error rates an order of magnitude better than classifie...
Gordon V. Cormack, Aleksander Kolcz
SIGIR
2009
ACM
13 years 11 months ago
Automatic video tagging using content redundancy
The analysis of the leading social video sharing platform YouTube reveals a high amount of redundancy, in the form of videos with overlapping or duplicated content. In this paper,...
Stefan Siersdorfer, José San Pedro, Mark Sa...
SIGIR
2009
ACM
13 years 11 months ago
The impact of crawl policy on web search effectiveness
Crawl selection policy has a direct influence on Web search effectiveness, because a useful page that is not selected for crawling will also be absent from search results. Yet th...
Dennis Fetterly, Nick Craswell, Vishwa Vinay
SIGIR
2009
ACM
13 years 11 months ago
A latent topic model for linked documents
Documents in many corpora, such as digital libraries and webpages, contain both content and link information. To explicitly consider the document relations represented by links, i...
Zhen Guo, Shenghuo Zhu, Yun Chi, Zhongfei Zhang, Y...