Inference of Code Familiarity by Mining Software Repositories
Abstract
The software development process is complex and, for this reason, it is common to execute it as a team. But teamwork can generate problems for an organization. One of these problems is the presence of portions of code known only by a single developer (or a small group). This fact can cause great difficulty in software maintenance. This paper presents a set of metrics and a tool called CoDiVision, that performs a data mining of code repositories to infer the familiarity that each developer has with the project. The CoDiVision was evaluated through the analysis of medium and large projects and the results indicate that the metrics and the tool can be an important support in software development.References
Beck, K., Beedle, M., Van Bennekum, A., Cockburn, A., Cunningham, W., Fowler, M., Grenning, J., Highsmith, J., Hunt, A., Jeffries, R., et al. (2001). Manifesto for agile software development.
Fritz, T., Murphy, G. C., Murphy-Hill, E., Ou, J., and Hill, E. (2014). Degree-ofknowledge: Modeling a developer’s knowledge of code. ACM Transactions on Software Engineering and Methodology (TOSEM), 23(2):14.
Hassan, A. E. (2008). The road ahead for mining software repositories. In Frontiers of Software Maintenance, 2008. FoSM 2008., pages 48–57. IEEE.
Mason, M. (2005). Pragmatic Version Control Using Subversion. Pragmatic Bookshelf.
Meng, X., Miller, B., Williams, W., and Bernat, A. (2013). Mining software repositories for accurate authorship. In Software Maintenance (ICSM), 2013 29th IEEE International Conference on, pages 250–259.
Moura, M. H. D. d., Nascimento, H. A. D. d., and Rosa, T. C. (2014). Extracting new metrics from version control system for the comparison of software developers. In Software Engineering (SBES), 2014 Brazilian Symposium on, pages 41–50. IEEE.
Sankoff, D. and Kruskal, J. B., editors (1983). Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison. Addison-Wesley.
Spinellis, D. (2005). Version control systems. Software, IEEE, 22(5):108–109.
Teles, V. M. (2004). Extreme programming. São Paulo: Novatec.
Torchiano, M., Ricca, F., and Marchetto, A. (2011). Is my project’s truck factor low?: Theoretical and empirical considerations about the truck factor threshold. In Proceedings of the 2Nd International Workshop on Emerging Trends in Software Metrics, WETSoM ’11, pages 12–18, New York, NY, USA. ACM.
Fritz, T., Murphy, G. C., Murphy-Hill, E., Ou, J., and Hill, E. (2014). Degree-ofknowledge: Modeling a developer’s knowledge of code. ACM Transactions on Software Engineering and Methodology (TOSEM), 23(2):14.
Hassan, A. E. (2008). The road ahead for mining software repositories. In Frontiers of Software Maintenance, 2008. FoSM 2008., pages 48–57. IEEE.
Mason, M. (2005). Pragmatic Version Control Using Subversion. Pragmatic Bookshelf.
Meng, X., Miller, B., Williams, W., and Bernat, A. (2013). Mining software repositories for accurate authorship. In Software Maintenance (ICSM), 2013 29th IEEE International Conference on, pages 250–259.
Moura, M. H. D. d., Nascimento, H. A. D. d., and Rosa, T. C. (2014). Extracting new metrics from version control system for the comparison of software developers. In Software Engineering (SBES), 2014 Brazilian Symposium on, pages 41–50. IEEE.
Sankoff, D. and Kruskal, J. B., editors (1983). Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison. Addison-Wesley.
Spinellis, D. (2005). Version control systems. Software, IEEE, 22(5):108–109.
Teles, V. M. (2004). Extreme programming. São Paulo: Novatec.
Torchiano, M., Ricca, F., and Marchetto, A. (2011). Is my project’s truck factor low?: Theoretical and empirical considerations about the truck factor threshold. In Proceedings of the 2Nd International Workshop on Emerging Trends in Software Metrics, WETSoM ’11, pages 12–18, New York, NY, USA. ACM.
Published
2017-08-28
How to Cite
IBIAPINA, Irvayne Matheus de Sousa; ALVES, Francisco Vanderson de Moura; LIRA, Werney Ayala Luz; SILVA, Gleison de Andrade e; DOS SANTOS NETO, Pedro de Alcântara.
Inference of Code Familiarity by Mining Software Repositories. In: BRAZILIAN SOFTWARE QUALITY SYMPOSIUM (SBQS), 16. , 2017, Rio de Janeiro.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2017
.
p. 104-118.
DOI: https://doi.org/10.5753/sbqs.2017.15095.
