Sciweavers

44 search results - page 3 / 9
» An XML Approach to Semantically Extract Data from HTML Table...
Sort
View
AAAI
1997
13 years 7 months ago
Template-Based Information Mining from HTML Documents
Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...
Jane Yung-jen Hsu, Wen-tau Yih
VLDB
2011
ACM
251views Database» more  VLDB 2011»
13 years 1 months ago
Harvesting relational tables from lists on the web
A large number of web pages contain data structured in the form of “lists”. Many such lists can be further split into multi-column tables, which can then be used in more seman...
Hazem Elmeleegy, Jayant Madhavan, Alon Y. Halevy
ESWS
2008
Springer
13 years 8 months ago
XSPARQL: Traveling between the XML and RDF Worlds - and Avoiding the XSLT Pilgrimage
With currently available tools and languages, translating between an existing XML format and RDF is a tedious and error-prone task. The importance of this problem is acknowledged b...
Waseem Akhtar, Jacek Kopecký, Thomas Krennw...
SIGMOD
2000
ACM
236views Database» more  SIGMOD 2000»
13 years 10 months ago
XTRACT: A System for Extracting Document Type Descriptors from XML Documents
XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the...
Minos N. Garofalakis, Aristides Gionis, Rajeev Ras...
ICDAR
2003
IEEE
13 years 11 months ago
A Constraint-based Approach to Table Structure Derivation
er presents an approach to deriving an abstract geometric model of a table from a physical representation. The technique developed uses a graph of constraints between cells which ...
Matthew Hurst