HTML document | Sciweavers

205

IJMSO
2008

149views more IJMSO 2008»

Categorisation of web documents using extraction ontologies

15 years 7 months ago

: Automatically recognising which HTML documents on the Web contain items of interest for a user is non-trivial. As a step toward solving this problem, we propose an approach based...

Li Xu, David W. Embley

claim paper

Read More »

183

click to vote

CSREAEEE
2006

154views Business» more CSREAEEE 2006»

Structural Discovery of E-lessons

15 years 9 months ago

Download ww1.ucmss.com

An e-lesson is comprised of a "body" and a "view". The body is the actual content of the e-lesson and the assumption is that it is an html document. The view i...

Azita Bahrami

claim paper

Read More »

283

click to vote

ISEC
2001
Springer

180views ECommerce» more ISEC 2001»

i-Cube: A Tool-Set for the Dynamic Extraction and Integration of Web Data Content

16 years 17 hour ago

Download www.swen.uwaterloo.ca

Over the past decade the Internet has evolved into the largest public community in the world. It provides a wealth of data content and services in almost every field of science, t...

Frankie Poon, Kostas Kontogiannis

claim paper

Read More »

205

click to vote

ISMIS
2003
Springer

131views Artificial Intelligence» more ISMIS 2003»

MetaNews: An Information Agent for Gathering News Articles on the Web

16 years 23 days ago

Download www.cs.iastate.edu

This paper presents MetaNews, an information gathering agent for news articles on the Web. MetaNews reads HTML documents from online news sites and extracts article information fro...

Dae-Ki Kang, Joongmin Choi

claim paper

Read More »

212

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

16 years 1 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

213

click to vote

WWW
2006
ACM

189views Internet Technology» more WWW 2006»

HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document

16 years 8 months ago

Download www2006.org

We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailin...

Tomoyuki Nanno, Manabu Okumura

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers