Sciweavers

WWW
2010
ACM
13 years 11 months ago
Beyond position bias: examining result attractiveness as a source of presentation bias in clickthrough data
Leveraging clickthrough data has become a popular approach for evaluating and optimizing information retrieval systems. Although data is plentiful, one must take care when interpr...
Yisong Yue, Rajan Patel, Hein Roehrig
WWW
2010
ACM
13 years 11 months ago
Clustering query refinements by user intent
Eldar Sadikov, Jayant Madhavan, Lu Wang, Alon Y. H...
WWW
2010
ACM
13 years 11 months ago
Actively predicting diverse search intent from user browsing behaviors
This paper is concerned with actively predicting search intent from user browsing behavior data. In recent years, great attention has been paid to predicting user search intent. H...
Zhicong Cheng, Bin Gao, Tie-Yan Liu
WWW
2010
ACM
13 years 11 months ago
Identifying spam link generators for monitoring emerging web spam
In this paper, we address the question of how we can identify hosts that will generate links to web spam. Detecting such spam link generators is important because almost all new s...
Young-joo Chung, Masashi Toyoda, Masaru Kitsuregaw...
WWW
2010
ACM
13 years 11 months ago
Detecting Wikipedia vandalism with active learning and statistical language models
Si-Chi Chin, W. Nick Street, Padmini Srinivasan, D...
WWW
2010
ACM
13 years 11 months ago
SpotRank: a robust voting system for social news websites
In a social news website people share content they found on the web, called news, then vote for those they like the most. Voting for a news is then considered as a recommendation,...
Thomas Largillier, Guillaume Peyronnet, Sylvain Pe...
WWW
2010
ACM
13 years 11 months ago
What is disputed on the web?
We present a method for automatically acquiring of a corpus of disputed claims from the web. We consider a factual claim to be disputed if a page on the web suggests both that the...
Rob Ennals, Dan Byler, John Mark Agosta, Barbara R...
WWW
2010
ACM
13 years 11 months ago
New-web search with microblog annotations
Web search engines discover indexable documents by recursively ‘crawling’ from a seed URL. Their rankings take into account link popularity. While this works well, it introduc...
Tom Rowlands, David Hawking, Ramesh Sankaranarayan...
WWW
2010
ACM
13 years 11 months ago
Analyzing content-level properties of the web adversphere
Advertising has become an integral and inseparable part of the World Wide Web. However, neither public auditing nor monitoring mechanisms still exist in this emerging area. In thi...
Yong Wang, Daniel Burgener, Aleksandar Kuzmanovic,...
WWW
2010
ACM
13 years 11 months ago
LCA-based selection for XML document collections
In this paper, we address the problem of database selection for XML document collections, that is, given a set of collections and a user query, how to rank the collections based o...
Georgia Koloniari, Evaggelia Pitoura