Sciweavers

WWW
2002
ACM

Improvement of HITS-based algorithms on web documents

13 years 4 months ago
Improvement of HITS-based algorithms on web documents
In this paper, we present two ways to improve the precision of HITS-based algorithms on Web documents. First, by analyzing the limitations of current HITS-based algorithms, we propose a new weighted HITS-based method that assigns appropriate weights to in-links of root documents. Then, we combine content analysis with HITS-based algorithms and study the e ects of four representative relevance scoring methods, VSM, Okapi, TLS, and CDR, using a set of broad topic queries. Our experimental results show that our weighted HITS-based method performs signi cantly better than Bharat's improved HITS algorithm. When we combine our weighted HITS-based method or Bharat's HITS algorithm with any of the four relevance scoring methods, the combined methods are only marginally better than our weighted HITS-based method. Between the four relevancescoring methods, there is no signi cant quality di erence when they are combined with a HITS-based algorithm. Categories and Subject Descriptors H....
Longzhuang Li, Yi Shang, Wei Zhang
Added 23 Dec 2010
Updated 23 Dec 2010
Type Journal
Year 2002
Where WWW
Authors Longzhuang Li, Yi Shang, Wei Zhang
Comments (0)