Sciweavers

ICSE
2010
IEEE-ACM

Linking e-mails and source code artifacts

13 years 6 months ago
Linking e-mails and source code artifacts
E-mails concerning the development issues of a system constitute an important source of information about high-level design decisions, low-level implementation concerns, and the social structure of developers. Establishing links between e-mails and the software artifacts they discuss is a non-trivial problem, due to the inherently informal nature of human communication. Different approaches can be brought into play to tackle this traceability issue, but the question of how they can be evaluated remains unaddressed, as there is no recognized benchmark against which they can be compared. In this article we present such a benchmark, which we created through the manual inspection of a statistically significant number of e-mails pertaining to six unrelated software systems. We then use our benchmark to measure the effectiveness of a number of approaches, ranging from lightweight approaches based on regular expressions to full-fledged information retrieval approaches.
Alberto Bacchelli, Michele Lanza, Romain Robbes
Added 12 Oct 2010
Updated 12 Oct 2010
Type Conference
Year 2010
Where ICSE
Authors Alberto Bacchelli, Michele Lanza, Romain Robbes
Comments (0)