Sciweavers

24 search results - page 2 / 5
» Web Page Cleaning for Web Mining through Feature Weighting
Sort
View
WWW
2008
ACM
14 years 5 months ago
Floatcascade learning for fast imbalanced web mining
This paper is concerned with the problem of Imbalanced Classification (IC) in web mining, which often arises on the web due to the "Matthew Effect". As web IC applicatio...
Xiaoxun Zhang, Xueying Wang, Honglei Guo, Zhili Gu...
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 2 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
SEBD
2008
177views Database» more  SEBD 2008»
13 years 6 months ago
Using PageRank in Feature Selection
Abstract. Feature selection is an important task in data mining because it allows to reduce the data dimensionality and eliminates the noisy variables. Traditionally, feature selec...
Dino Ienco, Rosa Meo, Marco Botta
KDD
2005
ACM
218views Data Mining» more  KDD 2005»
14 years 5 months ago
A maximum entropy web recommendation system: combining collaborative and content features
Web users display their preferences implicitly by navigating through a sequence of pages or by providing numeric ratings to some items. Web usage mining techniques are used to ext...
Xin Jin, Yanzan Zhou, Bamshad Mobasher
WWW
2004
ACM
14 years 5 months ago
Learning block importance models for web pages
Some previous works show that a web page can be partitioned to multiple segments or blocks, and usually the importance of those blocks in a page is not equivalent. Also, it is pro...
Ruihua Song, Haifeng Liu, Ji-Rong Wen, Wei-Ying Ma