Search Sciweavers | Sciweavers

609 search results - page 16 / 122

» Adaptive record extraction from web pages

102

click to vote

WWW
2004
ACM

156views Internet Technology» more WWW 2004»

Testbed for information extraction from deep web

16 years 2 months ago

Download research.microsoft.com

Search results generated by searchable databases are served dynamically and far larger than the static documents on the Web. These results pages have been referred to as the Deep ...

Yasuhiro Yamada, Nick Craswell, Tetsuya Nakatoh, S...

claim paper

Read More »

106

click to vote

WWW
2007
ACM

224views Internet Technology» more WWW 2007»

EPCI: extracting potentially copyright infringement texts from the web

16 years 2 months ago

Download www2007.org

In this paper, we propose a new system extracting potentially copyright infringement texts from the Web, called EPCI. EPCI extracts them in the following way: (1) generating a set...

Takashi Tashiro, Takanori Ueda, Taisuke Hori, Yu H...

claim paper

Read More »

123

click to vote

WWW
2005
ACM

173views Internet Technology» more WWW 2005»

Extracting semantic structure of web documents using content and visual information

16 years 2 months ago

Download www2005.org

This work aims to provide a page segmentation algorithm which uses both visual and content information to extract the semantic structure of a web page. The visual information is u...

Rupesh R. Mehta, Pabitra Mitra, Harish Karnick

claim paper

Read More »

137

Voted

HT
2003
ACM

136views Internet Technology» more HT 2003»

Extracting evolution of web communities from a series of web archives

15 years 7 months ago

Download www.ht03.org

Recent advances in storage technology make it possible to store a series of large Web archives. It is now an exciting challenge for us to observe evolution of the Web. In this pap...

Masashi Toyoda, Masaru Kitsuregawa

claim paper

Read More »

101

click to vote

WWW
2006
ACM

158views Internet Technology» more WWW 2006»

Finding advertising keywords on web pages

16 years 2 months ago

Download www2006.org

A large and growing number of web pages display contextual advertising based on keywords automatically extracted from the text of the page, and this is a substantial source of rev...

Wen-tau Yih, Joshua Goodman, Vitor R. Carvalho

claim paper

Read More »

« Prev « First page 16 / 122 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers