This paper presents a method for finding a specification page on the web for a given object (e.g., "Titanic") and its class label (e.g., "film"). A specificati...
What makes template content in the Web so special that we need to remove it? In this paper I present a large-scale aggregate analysis of textual Web content, corroborating statist...
The Digital Agora is an information resource that supports understanding and analysis of complex problems in the social sciences. Large amounts of data are available from many dif...
Carolyn R. Watters, Michael A. Shepherd, Cynthia A...
Poster. We highlight the latest developments in the Argos project. First, we describe our approach to automatic workflow composition. Second, we discuss the validation of the fre...
Web spam, which refers to any deliberate actions bringing to selected web pages an unjustifiable favorable relevance or importance, is one of the major obstacles for high quality ...