Sciweavers

440 search results - page 1 / 88
» Information Space Based on HTML Structure
Sort
View
TREC
2000
13 years 6 months ago
Information Space Based on HTML Structure
The main goal for the Information Space system for TREC9 was early precision. To facilitate this, an emphasis was placed on seeking matches from only the TITLE, H1, H2 and H3 tags...
Gregory B. Newby
WWW
2006
ACM
14 years 5 months ago
HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document
We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailin...
Tomoyuki Nanno, Manabu Okumura
AAAI
1997
13 years 6 months ago
Template-Based Information Mining from HTML Documents
Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...
Jane Yung-jen Hsu, Wen-tau Yih
INFORMATICALT
2007
164views more  INFORMATICALT 2007»
13 years 4 months ago
Extracting Personalised Ontology from Data-Intensive Web Application: an HTML Forms-Based Reverse Engineering Approach
The advance of the Web has significantly and rapidly changed the way of information organization, sharing and distribution. The next generation of the web, the semantic web, seeks...
Sidi Mohamed Benslimane, Mimoun Malki, Mustapha Ka...
AWIC
2005
Springer
13 years 10 months ago
Tuples Extraction from HTML Using Logic Wrappers and Inductive Logic Programming
This paper presents an approach for applying inductive logic programming to information extraction from HTML documents structured as unranked ordered trees. We consider information...
Costin Badica, Amelia Badica, Elvira Popescu