Sciweavers

244 search results - page 25 / 49
» From HTML documents to web tables and rules
Sort
View
ELPUB
2000
ACM
15 years 4 months ago
XML: More Than an E-Publishing Language
XML is an SGML-based language designed for the interchange of documents with more flexible and powerful features than those provided by HTML. It can be considered as an intermedia...
Jaime Delgado, Ramon Martí, Xavier Perramon
CIKM
1998
Springer
15 years 4 months ago
Ontology-Based Extraction and Structuring of Information from Data-Rich Unstructured Documents
We present a new approach to extracting information from unstructured documents based on an application ontology that describes a domain of interest. Starting with such an ontolog...
David W. Embley, Douglas M. Campbell, Randy D. Smi...
ACL
2006
15 years 1 months ago
A Grammatical Approach to Understanding Textual Tables Using Two-Dimensional SCFGs
We present an elegant and extensible model that is capable of providing semantic interpretations for an unusually wide range of textual tables in documents. Unlike the few existin...
Dekai Wu, Ken Wing Kuen Lee
ICDM
2006
IEEE
164views Data Mining» more  ICDM 2006»
15 years 5 months ago
Unsupervised Learning of Tree Alignment Models for Information Extraction
We propose an algorithm for extracting fields from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...
Philip Zigoris, Damian Eads, Yi Zhang
HCI
2009
14 years 9 months ago
Using Activity Descriptions to Generate User Interfaces for ERP Software
Delivering tailor-made ERP software requires automation of screen and printed report creation to be cost effective. Screens generated directly from data structures tend to have poo...
Timothy O'Hear, Yassin Boudjenane