Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
A web media agent is presented, which can make a user's web surfing experience more productive. Once the user visits a web page, semantic descriptions of the media objects on...
Zheng Chen, Liu Wenyin, Rui Yang, Mingjing Li, Hon...
The wealth of information available on the web makes it an attractive resource for seeking quick answers. While web-based question answering becomes an emerging topic in recent ye...
This paper presents results of an extensive long-term clickstream study of Web browser usage. Focusing on character and challenges of page revisitation, previous findings from sev...
Hartmut Obendorf, Harald Weinreich, Eelco Herder, ...