Search Sciweavers | Sciweavers

80 search results - page 1 / 16

» Extracting context to improve accuracy for HTML content extr...

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

14 years 10 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

click to vote

WWW
2005
ACM

188views Internet Technology» more WWW 2005»

Hybrid semantic tagging for information extraction

14 years 10 months ago

Download www.www2005.org

The semantic web is expected to have an impact at least as big as that of the existing HTML based web, if not greater. However, the challenge lays in creating this semantic web an...

Ronen Feldman, Binyamin Rosenfeld, Moshe Fresko, B...

claim paper

Read More »

click to vote

INFORMATICALT
2007

164views more INFORMATICALT 2007»

Extracting Personalised Ontology from Data-Intensive Web Application: an HTML Forms-Based Reverse Engineering Approach

13 years 9 months ago

Download www.mii.lt

The advance of the Web has signiﬁcantly and rapidly changed the way of information organization, sharing and distribution. The next generation of the web, the semantic web, seeks...

Sidi Mohamed Benslimane, Mimoun Malki, Mustapha Ka...

claim paper

Read More »

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

14 years 3 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

click to vote

IPM
2007

149views more IPM 2007»

Web page title extraction and its application

13 years 9 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...

claim paper

Read More »

« Prev « First page 1 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers