Search Sciweavers | Sciweavers

1161 search results - page 35 / 233

» Using web structure for classifying and describing web pages

146

click to vote

WWW
2009
ACM

131views Internet Technology» more WWW 2009»

Purely URL-based topic classification

16 years 6 months ago

Download www2009.org

Given only the URL of a web page, can we identify its topic? This is the question that we examine in this paper. Usually, web pages are classified using their content [7], but a U...

Eda Baykan, Monika Rauch Henzinger, Ludmila Marian...

claim paper

Read More »

134

click to vote

HICSS
2008
IEEE

175views Biometrics» more HICSS 2008»

An Examination of Genre Attributes for Web Page Classification

15 years 11 months ago

Download www.hicss.hawaii.edu

In this paper, we describe a set of experiments to examine the effect of various attributes of web genre on the automatic identification of the genre of web pages. Four different ...

Lei Dong, Carolyn R. Watters, Jack Duffy, Michael ...

claim paper

Read More »

142

click to vote

SIGKDD
2010

111views more SIGKDD 2010»

Unexpected results in automatic list extraction on the web

15 years 3 days ago

Download www.sigkdd.org

The discovery and extraction of general lists on the Web continues to be an important problem facing the Web mining community. There have been numerous studies that claim to autom...

Tim Weninger, Fabio Fumarola, Rick Barber, Jiawei ...

claim paper

Read More »

163

click to vote

CEAS
2011
Springer

259views Internet Technology» more CEAS 2011»

Spam detection using web page content: a new battleground

14 years 5 months ago

Download homepages.dcc.ufmg.br

Traditional content-based e-mail spam ﬁltering takes into account content of e-mail messages and apply machine learning techniques to infer patterns that discriminate spams from...

Marco Túlio Ribeiro, Pedro Henrique Calais ...

claim paper

Read More »

144

click to vote

VLDB
2007
ACM

134views Database» more VLDB 2007»

Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach

15 years 11 months ago

Download pages.cs.wisc.edu

Structured community portals extract and integrate information from raw Web pages to present a uniﬁed view of entities and relationships in the community. In this paper we argue...

Pedro DeRose, Warren Shen, Fei Chen 0002, AnHai Do...

claim paper

Read More »

« Prev « First page 35 / 233 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers