Availability Modeling of Big Data Applications in Private Clouds
Abstract
The growing development of cloud services and an exponential increase in the volume of data produced has made Hadoop the standard platform for managing and processing data in Big Data environments. Ensuring the high availability of cloud services is an essential criterion. However, it becomes a complicated task, given the number of components in the system. Therefore, we propose a modeling strategy based on stochastic Petri nets (SPN) and reliability block diagrams (RBD) to assess the availability and reliability of cloud services, considering different redundant mechanisms. This article adopts sensitivity analysis to evaluate the component that most affects the availability of the Hadoop cluster configured on the OpenStack cloud platform.
Keywords:
Private Cloud, Availability, Cloud Computing, Reliability Block Diagram, Cloud Service
References
Bauer, E., Adams, R., and Eustace, D. (2012). Service Availability and Service Reliability, pages 20–33.
Bhathal, G. S. and Dhiman, A. S. (2018). Big data solution: Improvised distributions framework of hadoop. In 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), pages 35–38.
Bhosale, V., Thakar, A., Pandit, C., Deshpande, A., and Khanuja, H. (2018). Hadoop in action: Building a generic log analyzing system. In 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), pages 1–7.
Chang, W. and Grady, N. (2019). Nist big data interoperability framework: Volume 1, definitions.
Florin, G., Fraize, C., and Natkin, S. (1991). Stochastic petri nets: Properties, applications and tools. Microelectronics Reliability, 31(4):669-697.
Guedes, E. A. C. (2019). Availability and capacity modeling for virtual network functions based on redundancy and rejuvenation supported through live migration.
Liberman, J. (2015). Arquiteture overview - deploying openstack sahara on red hat enterprise linux openstack platform 6.
Marynowski, J., Santin, A., and Pimentel, A. (2015). Method for testing the fault tolerance of mapreduce frameworks. Computer Networks.
Mell, P. M. and Grance, T. (2011). Sp 800-145. the nist definition of cloud computing. Technical report, Gaithersburg, MD, USA.
Melo, C., Matos, R., Dantas, J., and Maciel, P. (2017). Capacity-oriented availability model for resources estimation on private cloud infrastructure. In 2017 IEEE 22nd Pacific Rim International Symposium on Dependable Computing (PRDC), pages 255–260.
O’Connor, P. D. T. (1998). An introduction to reliability and maintainability engineering, charles e. ebeling, mcgraw-hill, 1997. Quality and Reliability Engineering International, 14(4):295–295.
Sousa, E. T. G. D. (2015). Modelagem de desempenho, dependabilidade e custo para o planejamento de infraestruturas de nuvens privadas.
Trivedi, K. (2016). Statistical Inference. Probability and Statistics with Reliability, Queuing and Computer Science Applications, chapter 10, pages 661–752. John Wiley Sons, Ltd.
Trivedi, K., Matias, R., Maciel, P., and Kim, D. S. (2012). Dependability modeling. In Performance and Dependability in Service Computing: Concepts, Techniques and Research Directions, pages 53–97.
Bhathal, G. S. and Dhiman, A. S. (2018). Big data solution: Improvised distributions framework of hadoop. In 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), pages 35–38.
Bhosale, V., Thakar, A., Pandit, C., Deshpande, A., and Khanuja, H. (2018). Hadoop in action: Building a generic log analyzing system. In 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), pages 1–7.
Chang, W. and Grady, N. (2019). Nist big data interoperability framework: Volume 1, definitions.
Florin, G., Fraize, C., and Natkin, S. (1991). Stochastic petri nets: Properties, applications and tools. Microelectronics Reliability, 31(4):669-697.
Guedes, E. A. C. (2019). Availability and capacity modeling for virtual network functions based on redundancy and rejuvenation supported through live migration.
Liberman, J. (2015). Arquiteture overview - deploying openstack sahara on red hat enterprise linux openstack platform 6.
Marynowski, J., Santin, A., and Pimentel, A. (2015). Method for testing the fault tolerance of mapreduce frameworks. Computer Networks.
Mell, P. M. and Grance, T. (2011). Sp 800-145. the nist definition of cloud computing. Technical report, Gaithersburg, MD, USA.
Melo, C., Matos, R., Dantas, J., and Maciel, P. (2017). Capacity-oriented availability model for resources estimation on private cloud infrastructure. In 2017 IEEE 22nd Pacific Rim International Symposium on Dependable Computing (PRDC), pages 255–260.
O’Connor, P. D. T. (1998). An introduction to reliability and maintainability engineering, charles e. ebeling, mcgraw-hill, 1997. Quality and Reliability Engineering International, 14(4):295–295.
Sousa, E. T. G. D. (2015). Modelagem de desempenho, dependabilidade e custo para o planejamento de infraestruturas de nuvens privadas.
Trivedi, K. (2016). Statistical Inference. Probability and Statistics with Reliability, Queuing and Computer Science Applications, chapter 10, pages 661–752. John Wiley Sons, Ltd.
Trivedi, K., Matias, R., Maciel, P., and Kim, D. S. (2012). Dependability modeling. In Performance and Dependability in Service Computing: Concepts, Techniques and Research Directions, pages 53–97.
Published
2021-07-18
How to Cite
CUNHA, A. C.; CALLOU, G. R. A.; SOUSA, E. T. G.; TAVARES, E. A. G..
Availability Modeling of Big Data Applications in Private Clouds. In: WORKSHOP ON PERFORMANCE OF COMPUTER AND COMMUNICATION SYSTEMS (WPERFORMANCE), 20. , 2021, Evento Online.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2021
.
p. 49-60.
ISSN 2595-6167.
DOI: https://doi.org/10.5753/wperformance.2021.15722.
