Sciweavers

SAC
2006
ACM

Undue influence: eliminating the impact of link plagiarism on web search rankings

13 years 4 months ago
Undue influence: eliminating the impact of link plagiarism on web search rankings
Link farm spam and replicated pages can greatly deteriorate link-based ranking algorithms like HITS. In order to identify and neutralize link farm spam and replicated pages, we look for sufficient material copied from one page to another. In particular, we focus on the use of "complete hyperlinks" to distinguish link targets by the anchor text used. We build and analyze the bipartite graph of documents and their complete hyperlinks to find pages that share anchor text and link targets. Link farms and replicated pages are identified in this process, permitting the influence of problematic links to be reduced in a weighted adjacency matrix. Experiments and user evaluation show significant improvement in the quality of results produced using HITS-like methods.
Baoning Wu, Brian D. Davison
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2006
Where SAC
Authors Baoning Wu, Brian D. Davison
Comments (0)