Sciweavers

JSS
2008

Examining the significance of high-level programming features in source code author classification

13 years 4 months ago
Examining the significance of high-level programming features in source code author classification
The use of Source Code Author Profiles (SCAP) represents a new, highly accurate approach to source code authorship identification that is, unlike previous methods, language independent. While accuracy is clearly a crucial requirement of any author identification method, in cases of litigation regarding authorship, plagiarism, and so on, there is also a need to know why it is claimed that a piece of code is written by a particular author. What is it about that piece of code that suggests a particular author? What features in the code make one author more likely than another? In this study, we describe a means of identifying the high-level features that contribute to source code authorship identification using as a tool the SCAP method. A variety of features are considered for Java and Common Lisp and the importance of each feature in determining authorship is measured through a sequence of experiments in which we remove one feature at a time. The results show that, for these programs, ...
Georgia Frantzeskou, Stephen G. MacDonell, Efstath
Added 13 Dec 2010
Updated 13 Dec 2010
Type Journal
Year 2008
Where JSS
Authors Georgia Frantzeskou, Stephen G. MacDonell, Efstathios Stamatatos, Stefanos Gritzalis
Comments (0)