Sciweavers

IDEAS
2002
IEEE

Integrating HTML Tables Using Semantic Hierarchies And Meta-Data Sets

13 years 9 months ago
Integrating HTML Tables Using Semantic Hierarchies And Meta-Data Sets
As the Internet is a global network, there is a demand on accessing closely related data without browsing through di erent Web documents. A signi cant amount of these data are presented in HTML documents that consist of markups and data contents. Since data contents of HTML documents are intervened by markups, it is not trivial to integrate and provide a uni ed view of closely related data in di erent HTML documents. In this paper, we present an approach for integrating semantically related data in any HTML tables that belong to a particular domain of interest (DOI), such as house/apartment rental and car advertisement, by using the semantic hierarchies generated from the tables and the prede ned meta-data sets that indicate related column names in DOI. In our integration approach, we capture each data source as semi-structured data, called semantic hierarchy, and the end result of integrating di erent HTML tables of a particular domain of interest is a uni ed view of data in the tabl...
Seung Jin Lim, Yiu-Kai Ng, Xiaochun Yang
Added 15 Jul 2010
Updated 15 Jul 2010
Type Conference
Year 2002
Where IDEAS
Authors Seung Jin Lim, Yiu-Kai Ng, Xiaochun Yang
Comments (0)