Performance analysis of Cloud Computing storage services for checkpoint execution

Abstract


Cloud computing has been used to execute high-performance computing applications due to its potential to reduce costs. The instability of low-cost instances - Spots - can be mitigated with checkpointing, and the performance of the storage service becomes crucial to avoid increasing the total runtime excessively. We compared the performance of four Cloud Computing storages services on AWS (EBS, EFS, FSx for Lustre e S3) for checkpoint persistence and observed that: (1) two of the four storage services showed scalability; (2) the storage service can increase the write performance by up to 727%.
Keywords: Cloud Computing, High Performance Computing, Spot, Storage

References

Cao, J., Kerr, G., Arya, K., and Cooperman, G. (2014). Transparent checkpoint-restart over infiniband. In Proceedings of the 23rd international symposium on High- performance parallel and distributed computing, pages 13–24.

Netto, M. A., Calheiros, R. N., Rodrigues, E. R., Cunha, R. L., and Buyya, R. (2018). Hpc cloud for scientific and business applications: Taxonomy, vision, and research challenges. ACM Computing Surveys (CSUR), 51(1):1–29.
Published
2020-08-19
RODAMILANS, Charles B.; BORIN, Edson. Performance analysis of Cloud Computing storage services for checkpoint execution. In: REGIONAL SCHOOL OF HIGH PERFORMANCE COMPUTING FROM SÃO PAULO (ERAD-SP), 11. , 2020, Evento Online. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2020 . p. 86-89. DOI: https://doi.org/10.5753/eradsp.2020.16893.

Most read articles by the same author(s)

1 2 3 > >>