Sciweavers

MSR
2010
ACM

The Ultimate Debian Database: Consolidating bazaar metadata for Quality Assurance and data mining

13 years 9 months ago
The Ultimate Debian Database: Consolidating bazaar metadata for Quality Assurance and data mining
—FLOSS distributions like RedHat and Ubuntu require a lot more complex infrastructures than most other FLOSS projects. In the case of community-driven distributions like Debian, the development of such an infrastructure is often not very organized, leading to new data sources being added in an impromptu manner while hackers set up new services that gain acceptance in the community. Mixing and matching data is then harder than should be, albeit being badly needed for Quality Assurance and data mining. Massive refactoring and integration is not a viable solution either, due to the constraints imposed by the bazaar development model. This paper presents the Ultimate Debian Database (UDD),1 which is the countermeasure adopted by the Debian project to the above “data hell”. UDD gathers data from various data sources into a single, central SQL database, turning Quality Assurance needs that could not be easily implemented before into simple SQL queries. The paper also discusses the cust...
Lucas Nussbaum, Stefano Zacchiroli
Added 10 Jul 2010
Updated 10 Jul 2010
Type Conference
Year 2010
Where MSR
Authors Lucas Nussbaum, Stefano Zacchiroli
Comments (0)