Sciweavers

926 search results - page 2 / 186
» Improving HTML Compression
Sort
View
USITS
1997
13 years 6 months ago
Using the Structure of HTML Documents to Improve Retrieval
Michal Cutler, Yungming Shih, Weiyi Meng
WWW
2004
ACM
14 years 5 months ago
Optimization of html automatically generated by wysiwyg programs
Automatically generated HTML, as produced by WYSIWYG programs, typically contains much repetitive and unnecessary markup. This paper identifies aspects of such HTML that may be al...
Jacqueline Spiesser, Les Kitchen
FINTAL
2006
13 years 8 months ago
Evaluation of Alignment Methods for HTML Parallel Text
The Internet constitutes a potential huge store of parallel text that may be collected to be exploited by many applications such as multilingual information retrieval, machine tran...
Enrique Sánchez Villamil, Susana Santos-Ant...
IFIP12
2004
13 years 6 months ago
Impact on Performance of Hypertext Classification of Selective Rich HTML Capture
: Hypertext categorization is the automatic classification of web documents into predefined classes. It poses new challenges for automatic categorization because of the rich inform...
Houda Benbrahim, Max Bramer
SIGIR
2005
ACM
13 years 10 months ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...