Sciweavers

ICWSM
2009

Content Based Recommendation and Summarization in the Blogosphere

13 years 2 months ago
Content Based Recommendation and Summarization in the Blogosphere
This paper presents a stochastic graph based method for recommending or selecting a small subset of blogs that best represents a much larger set. within a certain topic. Each blog is assigned a score that reflects how representative it is. Blog scores are calculated recursively in terms of the scores of their neighbors in a lexical similarity graph. A random walk is performed on a graph where nodes represent blogs and edges link lexically similar blogs. Lexical similarity is measured using either the cosine similarity measure, or the KullbackLeibler (KL) divergence. In addition, the presented method combines lexical centrality with information novelty to reduce redundancy in ranked blogs. Blogs similar to highly ranked blogs are discounted to make sure that diversity is maintained in the final rank. The presented method also allows to include additional initial quality priors to assess the quality of the blogs, such as frequency of new posts per day and the text fluency measured by n-...
Ahmed Hassan, Dragomir R. Radev, Junghoo Cho, Amru
Added 19 Feb 2011
Updated 19 Feb 2011
Type Journal
Year 2009
Where ICWSM
Authors Ahmed Hassan, Dragomir R. Radev, Junghoo Cho, Amruta Joshi
Comments (0)