Sciweavers

1161 search results - page 35 / 233
» Using web structure for classifying and describing web pages
Sort
View
77
Voted
WWW
2009
ACM
15 years 10 months ago
Purely URL-based topic classification
Given only the URL of a web page, can we identify its topic? This is the question that we examine in this paper. Usually, web pages are classified using their content [7], but a U...
Eda Baykan, Monika Rauch Henzinger, Ludmila Marian...
HICSS
2008
IEEE
175views Biometrics» more  HICSS 2008»
15 years 4 months ago
An Examination of Genre Attributes for Web Page Classification
In this paper, we describe a set of experiments to examine the effect of various attributes of web genre on the automatic identification of the genre of web pages. Four different ...
Lei Dong, Carolyn R. Watters, Jack Duffy, Michael ...
SIGKDD
2010
111views more  SIGKDD 2010»
14 years 4 months ago
Unexpected results in automatic list extraction on the web
The discovery and extraction of general lists on the Web continues to be an important problem facing the Web mining community. There have been numerous studies that claim to autom...
Tim Weninger, Fabio Fumarola, Rick Barber, Jiawei ...
CEAS
2011
Springer
13 years 9 months ago
Spam detection using web page content: a new battleground
Traditional content-based e-mail spam filtering takes into account content of e-mail messages and apply machine learning techniques to infer patterns that discriminate spams from...
Marco Túlio Ribeiro, Pedro Henrique Calais ...
73
Voted
VLDB
2007
ACM
134views Database» more  VLDB 2007»
15 years 3 months ago
Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach
Structured community portals extract and integrate information from raw Web pages to present a unified view of entities and relationships in the community. In this paper we argue...
Pedro DeRose, Warren Shen, Fei Chen 0002, AnHai Do...