Sciweavers

WWW
2010
ACM

Entity relation discovery from web tables and links

13 years 11 months ago
Entity relation discovery from web tables and links
The World-Wide Web consists not only of a huge number of unstructured texts, but also a vast amount of valuable structured data. Web tables [2] are a typical type of structured information that are pervasive on the web, and Web-scale methods that automatically extract web tables have been studied extensively [1]. Many powerful systems (e.g., OCTOPUS [4], Mesa [3]) use extracted web tables as a fundamental component. In the database vernacular, a table is defined as a set of tuples which have the same attributes. Similarly, a web table is defined as a set of rows (corresponding to database tuples) which have the same column headers (corresponding to database attributes). Therefore, to extract a web table is to extract a relation on the web. In databases, tables often contain foreign keys which refer to other tables. Therefore, it follows that hyperlinks inside a web table sometimes function as foreign keys to other relations whose tuples are contained in the hyperlink’s target page...
Cindy Xide Lin, Bo Zhao, Tim Weninger, Jiawei Han,
Added 14 May 2010
Updated 14 May 2010
Type Conference
Year 2010
Where WWW
Authors Cindy Xide Lin, Bo Zhao, Tim Weninger, Jiawei Han, Bing Liu
Comments (0)