Open Source Software (OSS) projects provide a unique opportunity to gather and analyze publicly available historical data. The Postgres SQL server, for example, has over seven years of recorded development and communication activity. We mined data from both the source code repository and the mailing list archives to examine the relationship between communication and development in Postgres. Along the way, we had to deal with the difficult challenge of resolving email aliases. We used a number of social network analysis measures and statistical techniques to analyze this data. We present our findings in this paper. Categories and Subject Descriptors D.2.8 [Software Engineering]: Metrics—Empirical, Open Source General Terms Human Factors, Measurement Keywords Open Source, Social Networks
Christian Bird, Alex Gourley, Premkumar T. Devanbu