Sciweavers

AWIC
2003
Springer

Web Page Classification: A Soft Computing Approach

13 years 9 months ago
Web Page Classification: A Soft Computing Approach
The Internet makes it possible to share and manipulate a vast quantity of information efficiently and effectively, but the rapid and chaotic growth experienced by the Net has generated a poorly organized environment that hinders the sharing and mining of useful data. The need for meaningful web-page classification techniques is therefore becoming an urgent issue. This paper describes a novel approach to web-page classification based on a fuzzy representation of web pages. A doublet representation that associates a weight with each of the most representative words of the web document so as to characterize its relevance in the document. This weight is derived by taking advantage of the characteristics of HTML language. Then a fuzzy-rule-based classifier is generated from a supervised learning process that uses a genetic algorithm to search for the minimum fuzzy-rule set that best covers the training examples. The proposed system has been demonstrated with two significantly different clas...
Angela Ribeiro, Víctor Fresno, Maria C. Gar
Added 06 Jul 2010
Updated 06 Jul 2010
Type Conference
Year 2003
Where AWIC
Authors Angela Ribeiro, Víctor Fresno, Maria C. García-Alegre, Domingo Guinea
Comments (0)