Sciweavers

368 search results - page 1 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
AAAI
1997
13 years 6 months ago
Template-Based Information Mining from HTML Documents
Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...
Jane Yung-jen Hsu, Wen-tau Yih
ACMICEC
2006
ACM
141views ECommerce» more  ACMICEC 2006»
13 years 10 months ago
From HTML documents to web tables and rules
We present a browser-extending Semantic Web extraction system that maps HTML documents to tables and, where possible, to rules. First, the basic data extractor ViPER distills and ...
Kai Simon, Georg Lausen, Harold Boley
IJCAI
2003
13 years 6 months ago
Information Extraction from Tree Documents by Learning Subtree Delimiters
Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...
Boris Chidlovskii
WWW
2006
ACM
14 years 5 months ago
HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document
We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailin...
Tomoyuki Nanno, Manabu Okumura
20
Voted
ESWS
2010
Springer
13 years 9 months ago
LESS - Template-Based Syndication and Presentation of Linked Data
Recently, the publishing of structured, semantic information as linked data has gained quite some momentum. For ordinary users on the Internet, however, this information is not yet...
Sören Auer, Raphael Doehring, Sebastian Dietz...