Sciweavers

ICSM
2009
IEEE

Modeling class cohesion as mixtures of latent topics

13 years 11 months ago
Modeling class cohesion as mixtures of latent topics
The paper proposes a new measure for the cohesion of classes in Object-Oriented software systems. It is based on the analysis of latent topics embedded in comments and identifiers in source code. The measure, named as Maximal Weighted Entropy, utilizes the Latent Dirichlet Allocation technique and information entropy measures to quantitatively evaluate the cohesion of classes in software. This paper presents the principles and the technology that stand behind the proposed measure. Two case studies on a large open source software system are presented. They compare the new measure with an extensive set of existing metrics and use them to construct models that predict software faults. The case studies indicate that the novel measure captures different aspects of class cohesion compared to the existing cohesion measures and improves fault prediction for most metrics, which are combined with Maximal Weighted Entropy.
Yixun Liu, Denys Poshyvanyk, Rudolf Ferenc, Tibor
Added 24 May 2010
Updated 24 May 2010
Type Conference
Year 2009
Where ICSM
Authors Yixun Liu, Denys Poshyvanyk, Rudolf Ferenc, Tibor Gyimóthy, Nikos Chrisochoides
Comments (0)