TY - GEN
T1 - Revisiting the applicability of the pareto principle to core development teams in open source software projects
AU - Yamashita, Kazuhiro
AU - McIntosh, Shane
AU - Kamei, Yasutaka
AU - Hassan, Ahmed E.
AU - Ubayashi, Naoyasu
PY - 2015/8/30
Y1 - 2015/8/30
N2 - It is often observed that the majority of the development work of an Open Source Software (OSS) project is contributed by a core team, i.e., a small subset of the pool of active developers. In fact, recent work has found that core development teams follow the Pareto principle-roughly 80% of the code contributions are produced by 20% of the active developers. However, those findings are based on samples of between one and nine studied systems. In this paper, we revisit prior studies about core developers using 2,496 projects hosted on GitHub. We find that even when we vary the heuristic for detecting core developers, and when we control for system size, team size, and project age: (1) the Pareto principle does not seem to apply for 40%-87% of GitHub projects; and (2) more than 88% of GitHub projects have fewer than 16 core developers. Moreover, we find that when we control for the quantity of contributions, bug fixing accounts for a similar proportion of the contributions of both core (18%-20%) and non-core developers (21%-22%). Our findings suggest that the Pareto principle is not compatible with the core teams of many GitHub projects. In fact, several of the studied GitHub projects are susceptible to the bus factor, where the impact of a core developer leaving would be quite harmful.
AB - It is often observed that the majority of the development work of an Open Source Software (OSS) project is contributed by a core team, i.e., a small subset of the pool of active developers. In fact, recent work has found that core development teams follow the Pareto principle-roughly 80% of the code contributions are produced by 20% of the active developers. However, those findings are based on samples of between one and nine studied systems. In this paper, we revisit prior studies about core developers using 2,496 projects hosted on GitHub. We find that even when we vary the heuristic for detecting core developers, and when we control for system size, team size, and project age: (1) the Pareto principle does not seem to apply for 40%-87% of GitHub projects; and (2) more than 88% of GitHub projects have fewer than 16 core developers. Moreover, we find that when we control for the quantity of contributions, bug fixing accounts for a similar proportion of the contributions of both core (18%-20%) and non-core developers (21%-22%). Our findings suggest that the Pareto principle is not compatible with the core teams of many GitHub projects. In fact, several of the studied GitHub projects are susceptible to the bus factor, where the impact of a core developer leaving would be quite harmful.
UR - http://www.scopus.com/inward/record.url?scp=84958664805&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84958664805&partnerID=8YFLogxK
U2 - 10.1145/2804360.2804366
DO - 10.1145/2804360.2804366
M3 - Conference contribution
AN - SCOPUS:84958664805
T3 - International Workshop on Principles of Software Evolution (IWPSE)
SP - 46
EP - 55
BT - 14th International Workshop on Principles of Software Evolution, IWPSE 2015 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 14th International Workshop on Principles of Software Evolution, IWPSE 2015
Y2 - 30 August 2015
ER -