Sciweavers

160 search results - page 3 / 32
» Web page classification with heterogeneous data fusion
Sort
View
CIKM
2009
Springer
13 years 12 months ago
Improving web page classification by label-propagation over click graphs
In this paper, we present a semi-supervised learning method for web page classification, leveraging click logs to augment training data by propagating class labels to unlabeled si...
Soo-Min Kim, Patrick Pantel, Lei Duan, Scott Gaffn...
WIDM
2004
ACM
13 years 10 months ago
Stylistic and lexical co-training for web block classification
Many applications which use web data extract information from a limited number of regions on a web page. As such, web page division into blocks and the subsequent block classifica...
Chee How Lee, Min-Yen Kan, Sandra Lai
IJCAI
2003
13 years 6 months ago
Web Page Cleaning for Web Mining through Feature Weighting
Unlike conventional data or text, Web pages typically contain a large amount of information that is not part of the main contents of the pages, e.g., banner ads, navigation bars, ...
Lan Yi, Bing Liu
HICSS
2008
IEEE
175views Biometrics» more  HICSS 2008»
13 years 11 months ago
An Examination of Genre Attributes for Web Page Classification
In this paper, we describe a set of experiments to examine the effect of various attributes of web genre on the automatic identification of the genre of web pages. Four different ...
Lei Dong, Carolyn R. Watters, Jack Duffy, Michael ...
ICDM
2002
IEEE
143views Data Mining» more  ICDM 2002»
13 years 10 months ago
Automatic Web Page Classification in a Dynamic and Hierarchical Way
Automatic classification of web pages is an effective way to deal with the difficulty of retrieving information from the Internet. Although there are many automatic classification...
Xiaogang Peng, Ben Choi