A Proposal for Sharing Software Process Provenance Data in Heterogeneous Environments

  • Gabriella Costa CEFET-MG
  • Eldânae Nogueira Teixeira UFRJ
  • Cláudia Werner UFRJ
  • Regina Braga UFJF


Software development practices have evolved, and new approaches have emerged, like Global Software Development (GSD). In addition, software development companies started to adopt data-driven practices in parts of their business. However, using and sharing software process data in a distributed and heterogeneous environment, like the GSD context, could be a challenging topic for many software engineers. In this paper, we present a proposal for sharing software process provenance data using a model that extends PROV, the PROV- SwProcess model. An example of applying this model using a process from the industry that deals with error handling and the implementation of new features in an Enterprise Resource Planning system is presented and explains how the model allows sharing software process provenance data, in addition to providing inferences and insights about these data.


BACHMANN, A., BERNSTEIN, A. Software process data quality and characteristics - a historical view on open and closed source projects. In: Proceedings of the joint international and annual ERCIM workshops on Principles of software evolution (IWPSE) and software evolution (Evol) workshops, Amsterdam, The Netherlands, August 2009, pp. 119-128, 2009.

BASILI, V., ROMBACH, D., SCHNEIDER, K., KITCHENHAM, B., PFAHL, D., SELBY, R. (Eds.). Empirical Software Engineering Issues. Critical Assessment and Future Directions: International Workshop, Dagstuhl Castle, Germany, June 26-30, 2006, Revised Papers (Vol. 4336). Springer. 2007.

BOSE, R.J.C., PHOKELA, K.K., KAULGUD, V., PODDER, S. Blinker: A blockchain-enabled framework for software provenance. In: 26th Asia-Pacific Software Engineering Conference (APSEC). IEEE, pp. 1-8, December 2019.

BUNEMAN, P., KHANNA, S., TAN, W.C. Why and where: A characterization of data provenance. In: 8th International Conference on Database Theory, London. pp. 4-6, 2001.

COSTA, G. C. B., SCHOTS, M., OLIVEIRA, W. E. B., DALPRA, H. L. O., WERNER, C. M. L., BRAGA, R., DAVID, J. M. N., MIGUEL, M. A., STROELE, V., CAMPOS, F. SPPV: Visualizing Software Process Provenance Data. In: IV Workshop on Software Visualization, Evolution and Maintenance - VII Brazilian Congress on Software: Theory and Practice (CBSoft 2016), pp. 49-56, 2016.

EISTY, Nasir U.; THIRUVATHUKAL, George K.; CARVER, Jeffrey C. Use of software process in research software development: a survey. In: Proceedings of the Evaluation and Assessment on Software Engineering, pp. 276-282, 2019.

FALBO, R. A., BERTOLLO, G. A software process ontology as a common vocabulary about software processes. International Journal of Business Process Integration and Management, v. 4, n. 4, pp. 239-250, 2009.

FREIRE, J., KOOP, D., SANTOS, E., SILVA, C. T. Provenance for Computational Tasks: A Survey. Computing in Science and Engineering, vol. 10, no. 3, pp. 11-21, 2008.

GROTH, P., MOREAU, L. PROV-Overview, 2013. Available at: https://www.w3.org/TR/prov-overview. Accessed on: jul 2021.

HERBSLEB, J. D. Global software engineering: The future of socio-technical coordination. In: Future of Software Engineering, pp. 188-198. IEEE, 2007.

JANSEN, S., A focus area maturity model for software ecosystem governance, Information and Software Technology, vol. 118, ISSN 0950-5849, 2020.

LEE, J. C., CHEN, C. Y. Exploring the team dynamic learning process in software process tailoring performance: A theoretical perspective. Journal of Enterprise Information Management, vol. 33, no. 3, pp. 502-518, 2020.

MENZIES, T., KOCAGUNELI, E., MINKU L., PETERS, F., TURHAN, B. Sharing data and models in software engineering. Morgan Kaufmann, 2014.

MÜNCH, J., ARMBRUST, O., KOWALCZYK, M., SOTÓ, M. Software process definition and management. Springer Science & Business Media, 2012.
COSTA, Gabriella; TEIXEIRA, Eldânae Nogueira; WERNER, Cláudia; BRAGA, Regina. A Proposal for Sharing Software Process Provenance Data in Heterogeneous Environments. In: WORKSHOP DE PRÁTICAS DE CIÊNCIA ABERTA PARA ENGENHARIA DE SOFTWARE (OPENSCIENSE), 1. , 2021, Joinville. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . p. 37-42. DOI: https://doi.org/10.5753/opensciense.2021.17144.