Sciweavers

MSR
2009
ACM

On mining data across software repositories

13 years 9 months ago
On mining data across software repositories
Software repositories provide abundance of valuable information about open source projects. With the increase in the size of the data maintained by the repositories, automated extraction of such data from individual repositories, as well as of linked information across repositories, has become a necessity. In this paper we describe a framework that uses web scraping to automatically mine repositories and link information across repositories. We discuss two implementations of the framework. In the first implementation, we automatically identify and collect security problem reports from project repositories that deploy the Bugzilla bug tracker using related vulnerability information from the National Vulnerability Database. In the second, we collect security problem reports for projects that deploy the Launchpad bug tracker along with related vulnerability information from the National Vulnerability Database. We have evaluated our tool on various releases of Fedora, Ubuntu, Suse, RedHa...
Prasanth Anbalagan, Mladen A. Vouk
Added 23 Jul 2010
Updated 23 Jul 2010
Type Conference
Year 2009
Where MSR
Authors Prasanth Anbalagan, Mladen A. Vouk
Comments (0)