Search Sciweavers | Sciweavers

311 search results - page 16 / 63

» Cleaning Web Pages for Effective Web Content Mining

click to vote

WWW
2008
ACM

127views Internet Technology» more WWW 2008»

Genealogical trees on the web: a search engine user perspective

16 years 9 days ago

Download www2008.org

This paper presents an extensive study about the evolution of textual content on the Web, which shows how some new pages are created from scratch while others are created using al...

Ricardo A. Baeza-Yates, Álvaro R. Pereira J...

claim paper

Read More »

click to vote

KDD
2002
ACM

148views Data Mining» more KDD 2002»

Discovering informative content blocks from Web documents

16 years 22 hour ago

Download www.cs.ualberta.ca

In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...

Shian-Hua Lin, Jan-Ming Ho

claim paper

Read More »

click to vote

WWW
2004
ACM

156views Internet Technology» more WWW 2004»

What's new on the web?: the evolution of the web from a search engine perspective

16 years 9 days ago

Download www.iw3c2.org

We seek to gain improved insight into how Web search engines should cope with the evolving Web, in an attempt to provide users with the most up-to-date results possible. For this ...

Alexandros Ntoulas, Junghoo Cho, Christopher Olsto...

claim paper

Read More »

107

Voted

WWW
2011
ACM

298views Internet Technology» more WWW 2011»

HyLiEn: a hybrid approach to general list extraction on the web

14 years 6 months ago

Download www.cs.uiuc.edu

We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...

Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...

claim paper

Read More »

115

Voted

KDD
2007
ACM

182views Data Mining» more KDD 2007»

Cleaning disguised missing data: a heuristic approach

16 years 22 hour ago

Download www.cs.sfu.ca

In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...

Ming Hua, Jian Pei

claim paper

Read More »

« Prev « First page 16 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers