Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

121

APWEB
2003
Springer

favoriteEmaildiscussreport

148views Internet Technology» more APWEB 2003»

Extracting Content Structure for Web Pages Based on Visual Representation

15 years 7 months ago

Extracting Content Structure for Web Pages Based on Visual Representation

Download www.dbs.ifi.lmu.de

Abstract. A new web content structure based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and automatic page adaptation can benefit from this structure. This paper presents an automatic top-down, tag-tree independent approach to detect web content structure. It simulates how a user understands web layout structure based on his visual perception. Comparing to other existing techniques, our approach is independent to underlying documentation representation such as HTML and works well even when the HTML structure is far different from layout structure. Experiments show satisfactory results.

Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma

Real-time Traffic

APWEB 2003 | Layout Structure | Web Content Structure | Web Layout Structure |

claim paper

Related Content

» Extracting semantic structure of web documents using content and visual information

» HyLiEn a hybrid approach to general list extraction on the web

» Recognition of Common Areas in a Web Page Using Visual Information a possible application ...

» Reorganizing Web Sites Based on User Access Patterns

» Deep web data extraction

» Document Visualization on Small Displays

» ContentBased Retrieval of Web Pages and Other Hierarchical Objects with Selforganizing Map...

» Web page sectioning using regexbased template

» Revealing Hidden Community Structures and Identifying Bridges in Complex Networks An Appli...

Post Info
More Details (n/a)

Added	06 Jul 2010
Updated	06 Jul 2010
Type	Conference
Year	2003
Where	APWEB
Authors	Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma

Comments (0)