A Comparative Study of Algorithms for Estimating Truck Factor

  • Mívian Ferreira UFMG
  • Guilherme Avelino UFMG
  • Marco Tulio Valente UFMG
  • Kecia A. M. Ferreira CEFET-MG

Resumo


In modern software projects, it is crucial to have reliable data about how knowledge on the source code is distributed among the team members. This information can help for example to avoid "islands of knowledge" and to prevent the risks associated to the loss of key developers. Truck factor is a key measure proposed to estimate such risks. Basically, truck factor (aka bus factor) designates the minimal number of developers that have to be hit by a truck (or quit) before a project is incapacitated. Although being a key measure of the concentration of information among team members, we still have few algorithms proposed to estimate truck factors. More importantly, we lack rigorous comparisons of the existing algorithms. Therefore, in this paper we provide a comparative study of the two main algorithms proposed in the literature to estimate truck factors. For this purpose, we rely on a large dataset of 133 popular GitHub systems. We compare both the performance of these algorithms and the truck factors estimated by them.
Palavras-chave: Software, Computer architecture, Direction-of-arrival estimation, Software algorithms, Electronic mail, Java, Education, truck factor, code authorship, github
Publicado
19/09/2016
FERREIRA, Mívian; AVELINO, Guilherme; VALENTE, Marco Tulio; FERREIRA, Kecia A. M.. A Comparative Study of Algorithms for Estimating Truck Factor. In: SIMPÓSIO BRASILEIRO DE COMPONENTES, ARQUITETURAS E REUTILIZAÇÃO DE SOFTWARE (SBCARS), 10. , 2016, Maringá/PR. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2016 . p. 91-100.