On the Interaction between Software Engineers and Data Scientists when Building Machine Learning-Enabled Systems

  • Gabriel Busquim PUC-Rio
  • Maria Julia Lima PUC-Rio
  • Marcos Kalinowski PUC-Rio

Abstract


Engineering ML-enabled systems presents various challenges from both a theoretical and practical perspective. One of the key challenges is the effective interaction between actors with different backgrounds who need to work closely together, such as software engineers and data scientists. This dissertation involved three studies investigating the current collaboration dynamics between these two roles in ML projects. Our studies revealed several challenges that can hinder collaboration between software engineers and data scientists, including differences in technical expertise and unclear definitions of each role’s duties. Potential solutions to address these challenges include encouraging team communication and producing concise system documentation.

Keywords: Machine Learning, ML-enabled System, Data Science, Software Engineering, Collaboration

References

Gabriel Busquim, Allysson Araújo, Maria Lima, and Marcos Kalinowski. 2024. Towards Effective Collaboration between Software Engineers and Data Scientists developing Machine Learning-Enabled Systems. In Anais do XXXVIII Simpósio Brasileiro de Engenharia de Software. SBC, 24–34.

Gabriel Busquim, Maria Julia Lima, and Marcos Kalinowski. 2024. On the Interaction between Software Engineers and Data Scientists when Building Machine Learning-Enabled Systems. Master’s thesis. [link]

Gabriel Busquim, Hugo Villamizar, Maria Julia Lima, and Marcos Kalinowski. 2024. On the Interaction between Software Engineers and Data Scientists when Building Machine Learning-Enabled Systems. In International Conference on Software Quality. Springer, 55–75.

Nadia Nahar, Shurui Zhou, Grace Lewis, and Christian Kästner. 2022. Collaboration challenges in building ml-enabled systems: Communication, documentation, engineering, and process. In Proceedings of the 44th international conference on software engineering. 413–425.
Published
2025-11-04
BUSQUIM, Gabriel; LIMA, Maria Julia; KALINOWSKI, Marcos. On the Interaction between Software Engineers and Data Scientists when Building Machine Learning-Enabled Systems. In: BRAZILIAN SOFTWARE QUALITY SYMPOSIUM (SBQS), 24. , 2025, São José dos Campos/SP. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2025 . p. 577-577. DOI: https://doi.org/10.5753/sbqs.2025.14987.