Sciweavers

W4A
2006
ACM

Transforming web pages to become standard-compliant through reverse engineering

13 years 9 months ago
Transforming web pages to become standard-compliant through reverse engineering
Developing Web pages following established standards can make the information more accessible, their rendering more efficient, and their processing by computer applications easier. Unfortunately, more than 95% of the existing Web pages today are not “valid” in that they do not follow some of the recommendations (standards) of the World Wide Web Consortium (W3C). Fixing any Web page to make it standardcompliant is a major undertaking. There is now an open-source tool called HTML Tidy which will attempt to fix the invalid HTML code automatically. However, Tidy often changes the Web page’s appearance after processing. It is not an effective tool to transform existing Web pages to make them standard-compliant. In this paper we report the design and implementation of PURE, a tool that cleans up an HTML document through reverse engineering. PURE starts with the rendering result of a given Web page and generates valid HTML code and CSS automatically to produce the same appearance. It i...
Benfeng Chen, Vincent Y. Shen
Added 14 Jun 2010
Updated 14 Jun 2010
Type Conference
Year 2006
Where W4A
Authors Benfeng Chen, Vincent Y. Shen
Comments (0)