Sciweavers

AIRWEB
2009
Springer

A study of link farm distribution and evolution using a time series of web snapshots

13 years 11 months ago
A study of link farm distribution and evolution using a time series of web snapshots
In this paper, we study the overall link-based spam structure and its evolution which would be helpful for the development of robust analysis tools and research for Web spamming as a social activity in the cyber space. First, we use strongly connected component (SCC) decomposition to separate many link farms from the largest SCC, so called the core. We show that denser link farms in the core can be extracted by node filtering and recursive application of SCC decomposition to the core. Surprisingly, we can find new large link farms during each iteration and this trend continues until at least 10 iterations. In addition, we measure the spamicity of such link farms. Next, the evolution of link farms is examined over two years. Results show that almost all large link farms do not grow anymore while some of them shrink, and many large link farms are created in one year. Categories and Subject Descriptors H.3 [Information Storage and Retrieval]: Information Search and Retrieval General Te...
Young-joo Chung, Masashi Toyoda, Masaru Kitsuregaw
Added 25 May 2010
Updated 25 May 2010
Type Conference
Year 2009
Where AIRWEB
Authors Young-joo Chung, Masashi Toyoda, Masaru Kitsuregawa
Comments (0)