Sciweavers

HICSS
2007
IEEE

Automatic Web Page Categorization using Principal Component Analysis

13 years 10 months ago
Automatic Web Page Categorization using Principal Component Analysis
Today’s search engines retrieve tens of thousands of web pages in response to fairly simple query articulations. These pages are retrieved on the basis of the query terms occurring in the web pages and the popularity of the web pages as per the link structure of the web. However, these search engines do not take into account the broader information need of the user, such as the task in which the user is involved. This research investigates the automatic categorization of web pages using Principal Component Analysis. The research focuses on user tasks that involve searching for web pages containing health information, education information or shopping information. Initial results are encouraging with recall and precision values slightly in excess of 80%.
Richong Zhang, Michael A. Shepherd, Jack Duffy, Ca
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where HICSS
Authors Richong Zhang, Michael A. Shepherd, Jack Duffy, Carolyn R. Watters
Comments (0)