This paper is concerned with the problem of structured data extraction from Web pages. The objective of the research is to automatically segment data records in a page, extract da...
We discuss learning a profile of user interests for recommending information sources such as Web pages or news articles. We describe the types of information available to determin...
The navigation routing code of a web application is the part of the code involved in routing a request from a web page through the appropriate components on the server, typically e...
The problem of measuring similarity between web pages arises in many important Web applications, such as search engines and Web directories. In this paper, we propose a novel neig...
A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although s...
Paolo Papotti, Valter Crescenzi, Paolo Merialdo, M...