Search Sciweavers | Sciweavers

21 search results - page 1 / 5

» Title extraction from bodies of HTML documents and its appli...

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

13 years 10 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

click to vote

IPM
2007

149views more IPM 2007»

Web page title extraction and its application

13 years 4 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...

claim paper

Read More »

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

14 years 5 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

click to vote

WWW
2003
ACM

130views Internet Technology» more WWW 2003»

DOM-based content extraction of HTML documents

14 years 5 months ago

Download www.psl.cs.columbia.edu

Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...

Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...

claim paper

Read More »

click to vote

WWW
2011
ACM

316views Internet Technology» more WWW 2011»

Identifying primary content from web pages and its application to web search ranking

12 years 11 months ago

Download www.www2011india.com

Web pages are usually highly structured documents. In some documents, content with diﬀerent functionality is laid out in blocks, some merely supporting the main discourse. In ot...

Srinivas Vadrevu, Emre Velipasaoglu

claim paper

Read More »

« Prev « First page 1 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers