Sciweavers

WWW
2006
ACM

HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document

14 years 5 months ago
HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document
We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailing lists, site update descriptions, and event announcements. Our system extracts date expressions, performs structure analysis of a HTML document, and detects or generates titles from the document. Categories and Subject Descriptors H.5.4 [Information Systems]: Hypertext/Hypermedia; H.3.5 [Information Systems]: Online Information Services General Terms Management Keywords RSS, Atom, feed, document analysis, syndication
Tomoyuki Nanno, Manabu Okumura
Added 22 Nov 2009
Updated 22 Nov 2009
Type Conference
Year 2006
Where WWW
Authors Tomoyuki Nanno, Manabu Okumura
Comments (0)