Sciweavers

Share
SIGIR
2010
ACM

Prototype hierarchy based clustering for the categorization and navigation of web collections

11 years 3 months ago
Prototype hierarchy based clustering for the categorization and navigation of web collections
This paper presents a novel prototype hierarchy based clustering (PHC) framework for the organization of web collections. It solves simultaneously the problem of categorizing web collections and interpreting the clustering results for navigation. By utilizing prototype hierarchies and the underlying topic structures of the collections, PHC is modeled as a multi-criterion optimization problem based on minimizing the hierarchy evolution, maximizing category cohesiveness and inter-hierarchy structural and semantic resemblance. The flexible design of metrics enables PHC to be a general framework for applications in various domains. In the experiments on categorizing 4 collections of distinct domains, PHC achieves 30% improvement in μF1 over the state-of-the-art techniques. Further experiments provide inn performance variations with abstract and concrete domains, completeness of the prototype hierarchy, and effects of different combinations of optimization criteria. Categories and Subje...
Zhaoyan Ming, Kai Wang, Tat-Seng Chua
Added 16 Aug 2010
Updated 16 Aug 2010
Type Conference
Year 2010
Where SIGIR
Authors Zhaoyan Ming, Kai Wang, Tat-Seng Chua
Comments (0)
books