Comparing Provenance Data Models for Scientific Workflows: an Analysis of PROV-Wf and ProvOne

  • Wellington Oliveira UFF / Newcastle University
  • Paolo Missier UFF
  • Daniel de Oliveira UFF
  • Vanessa Braganholo UFF

Resumo


Scientific workflows rely on provenance to be understandable, reproducible and trustworthy. Nowadays, there is a growing demand for interoperability between provenance data generated from heterogeneous workflow management systems. To address this issue, some provenance models have been proposed by extending PROV to support specific requirements of scientific workflows. In this paper, we present two prominent provenance models for scientific workflows, PROV-Wf and ProvOne, which are specializations of PROV, and compare their elements and relationships. Our goal is to provide an overview of each one and to support the choice for the most suitable for a specific context.


 

Referências

Bivar, B., Santos, L., Kohwalter, T., et al. (jul 2013). Uma Comparação entre os Modelos de Proveniência OPM e PROV.

Costa, F., Silva, V., De Oliveira, D., et al. (2013). Capturing and querying workflow runtime provenance with PROV: a practical approach. In Proceedings of the Joint EDBT/ICDT 2013 Workshops. ACM.

De Oliveira, D., Silva, V. and Mattoso, M. (2015). How Much Domain Data Should Be in Provenance Databases? In Proceeding of the 7th USENIX Workshop on the Theory and Practice of Provenance (TaPP 15). . USENIX Association.

Foster, I., Vöckler, J., Wilde, M. and Zhao, Y. (2002). Chimera: a virtual data system for representing, querying, and automating data derivation. In 14th International Conference on Scientific and Statistical Database Management, 2002. Proceedings.

Freire, J., Koop, D., Santos, E. and Silva, C. T. (2008). Provenance for Computational Tasks: A Survey. Computing in Science Engineering, v. 10, n. 3, p. 11–21.

Ludäscher, B., Altintas, I., Berkley, C., et al. (2006). Scientific workflow management and the Kepler system. Concurrency and Computation: Practice and Experience, v. 18, n. 10, p. 1039–1065.

Missier, P., Dey, S., Belhajjame, K., Cuevas-Vicenttín, V. and Ludäscher, B. (2013). DPROV: Extending the PROV Provenance Model with Workflow Structure. In TaPP 13., TaPP ’13. USENIX Association. http://dl.acm.org/citation.cfm?id=2482949.2482961, [accessed on Apr 30].

Moreau, L., Freire, J., Futrelle, J., et al. (2008). The Open Provenance Model: An Overview. In IPAW, Lecture Notes in Computer Science. Springer. http://link.springer.com/chapter/10.1007/978-3-540-89965-5_31.
Publicado
04/07/2016
OLIVEIRA, Wellington; MISSIER, Paolo; DE OLIVEIRA, Daniel; BRAGANHOLO, Vanessa. Comparing Provenance Data Models for Scientific Workflows: an Analysis of PROV-Wf and ProvOne. In: BRAZILIAN E-SCIENCE WORKSHOP (BRESCI), 10. , 2016, Porto Alegre. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2016 . p. 237-244. ISSN 2763-8774. DOI: https://doi.org/10.5753/bresci.2016.9972.