Rflow: Uma Abordagem de Reutilização de Workflows Estatísticos Legados

  • José Antônio Pires do Nascimento UFRRJ / Embrapa
  • Sérgio Manuel Serra da Cruz UFRRJ

Abstract


This paper presents an approach named RFlow, it aims to facilitate the management of statistical workflows and to mitigate limitations of statistical packages with regards to the management of provenance. RFlow is an approach that allows scientists to use legacy R scripts as meta-workflows enhancing their reuse, sharing, execution control with support of provenance metadata about the executions of such workflows.

References

Altintas, I. et al (2006), “Provenance Collection Support in the Kepler Scientific Workflow System”, Proc. of IPAW2006, 118-132.

Chambers, J. R (2008) Software Data Analysis Programming with R Software. Springer. 1st edition.

Crawley, M.J. (2002) Statistical Computing to Data Analysis using S-plus. Wiley. 1st edition.

Cruz, S. M. S. , Campos, M L M., Mattoso (2009) “Towards a Taxonomy of Provenance in Scientific Workflow Management Systems”,In: Services, .2009. pp. 259 – 266.

Higgins, D. (2007) ,“Using R in Kepler”, [link].

Kirchkamp, O. (2011) “Workflow of statistical data analysis”. http://www.kirchkamp.de/oekonometrie/pdf/wf-screen2.pdf.

Kumar, A., Wainer, J. (2005) “Meta-workflows as a control and coordination mechanism for exception handling in workflow systems”. Decision Support Systems. v. 40 pp. 89-105.

Ludäscher, B. et al. (2006) "Scientific workflow management and the Kepler system: Research Articles". Concurrency and Computation: Practice & Experience, v. 18, n. 10, pp. 1039-1065.

Mair, P., de Leeuw, J. (2010). “A general framework for multivariate analysis with optimal scaling: The R package aspect”. Journal of Statistical Software, 32(9), pp. 1-12.

Marinho et al (2012) “ProvManager: a provenance management system for scientific workflows” Conc. and Comp.: Practice & Experience. v. 24, n. 13, pp. 1513-1530.

Mattoso, M. et al., (2009), "Desafios no apoio à composição de experimentos científicos em larga escala". In: Seminário Integrado de Software e Hardware (XXXVI SEMISH), pp. 307-321.

Runnalls, A. (2013) “CXXR: an extensible R interpreter “In: Wiley Interdisciplinary Reviews: Computational Statistics. DOI: 10.1002/wics.1251.

Qin, Z., Xing, J., Zheng, X., (2008), Software architecture. Springer. 1st edition. Ranabahu, A., Anderson, P. Sheth, A. P. (2011) “The Cloud Agnostic e-Science Analysis Platform”. IEEE Internet Computing v. 15.pp. 85-89.

Silles, C. A., Runnalls, A. (2010) “Provenance-Awareness in R”. LNCS 6378. Springer, pp. 64-72.
Published
2013-07-23
NASCIMENTO, José Antônio Pires do; CRUZ, Sérgio Manuel Serra da. Rflow: Uma Abordagem de Reutilização de Workflows Estatísticos Legados. In: BRAZILIAN E-SCIENCE WORKSHOP (BRESCI), 7. , 2013, Maceió. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2013 . p. 1815-1822. ISSN 2763-8774.