Search Sciweavers | Sciweavers

193 search results - page 1 / 39

» Automatic Discovery of Semantic Structures in HTML Documents

131

click to vote

ICDAR
2003
IEEE

143views Document Analysis» more ICDAR 2003»

Automatic Discovery of Semantic Structures in HTML Documents

15 years 10 months ago

Download www.cs.sunysb.edu

Template-driven HTML documents posses an implicit, ﬁxed schema denoting concepts and their relationships in a hierarchical fashion. Discovering this schema remains a relatively ...

Saikat Mukherjee, Guizhen Yang, Wenfang Tan, I. V....

claim paper

Read More »

139

click to vote

FLAIRS
2007

208views Artificial Intelligence» more FLAIRS 2007»

Contextual Concept Discovery Algorithm

15 years 7 months ago

Download www.aaai.org

In this paper, we focus on the ontological concept extraction and evaluation process from HTML documents. In order to improve this process, we propose an unsupervised hierarchical...

Lobna Karoui, Marie-Aude Aufaure, Nacéra Be...

claim paper

Read More »

162

click to vote

WWW
2006
ACM

189views Internet Technology» more WWW 2006»

HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document

16 years 6 months ago

Download www2006.org

We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailin...

Tomoyuki Nanno, Manabu Okumura

claim paper

Read More »

147

click to vote

AAAI
1997

162views Intelligent Agents» more AAAI 1997»

Template-Based Information Mining from HTML Documents

15 years 6 months ago

Download research.microsoft.com

Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...

Jane Yung-jen Hsu, Wen-tau Yih

claim paper

Read More »

143

click to vote

WWW
2004
ACM

134views Internet Technology» more WWW 2004»

Hearsay: enabling audio browsing on hypertext content

16 years 6 months ago

Download www.iw3c2.org

In this paper we present HearSay, a system for browsing hypertext Web documents via audio. The HearSay system is based on our novel approach to automatically creating audio browsa...

I. V. Ramakrishnan, Amanda Stent, Guizhen Yang

claim paper

Read More »

« Prev « First page 1 / 39 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers