Sciweavers

SCAM
2007
IEEE

A Framework for Studying Clones In Large Software Systems

13 years 10 months ago
A Framework for Studying Clones In Large Software Systems
Clones are code segments that have been created by copying-and-pasting from other code segments. Clones occur often in large software systems. It is reported that 5 to 50% of the source code of a large software system is cloned. A major challenge when studying code cloning in large software systems is handling the large amount of clone candidates produced by leading edge clone detection tools. For example, the CCFinder, clone detection tool, produces over 7 million pairs of clone candidates for the Linux kernel (which consists of over 4 MLOC). Moreover, the output of clone detection tools grows rapidly as a software system evolves. Researchers and developers need tools to help them study the large amount of clone data in order to better understand the clone phenomena in large systems. In this paper, we propose a data mining framework to help researchers cope with the large amount of data produced by clone detection tools. We techniques to reduce, abstract and highlight the most intere...
Zhen Ming Jiang, Ahmed E. Hassan
Added 04 Jun 2010
Updated 04 Jun 2010
Type Conference
Year 2007
Where SCAM
Authors Zhen Ming Jiang, Ahmed E. Hassan
Comments (0)