Search Sciweavers | Sciweavers

1319 search results - page 1 / 264

» Using the Structure of HTML Documents to Improve Retrieval

137

click to vote

ICTAI
1999
IEEE

101views Artificial Intelligence» more ICTAI 1999»

A New Study on Using HTML Structures to Improve Retrieval

15 years 9 months ago

Download www.cs.binghamton.edu

Locating useful information effectively from the World Wide Web (WWW) is of wide interest. This paper presents new results on a methodology of using the structures and hyperlinks ...

Michal Cutler, H. Deng, S. Maniccam, Weiyi Meng

claim paper

Read More »

100

click to vote

USITS
1997

92views Operating System» more USITS 1997»

Using the Structure of HTML Documents to Improve Retrieval

15 years 6 months ago

Download www.usenix.org

Michal Cutler, Yungming Shih, Weiyi Meng

claim paper

Read More »

164

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

15 years 10 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

140

click to vote

AAAI
1997

162views Intelligent Agents» more AAAI 1997»

Template-Based Information Mining from HTML Documents

15 years 6 months ago

Download research.microsoft.com

Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...

Jane Yung-jen Hsu, Wen-tau Yih

claim paper

Read More »

145

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

16 years 6 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

« Prev « First page 1 / 264 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers