Energy-Aware Scheduling for Serverless Scientific Workflows: A Machine Learning Approach

  • Lucas Rosa USP
  • Alfredo Goldman USP

Abstract


This article proposes to address the challenges of energy efficiency and workflow scheduling in serverless computing environments. By integrating machine learning techniques and simulation, the research aims to bridge gaps between energy efficiency and serverless scheduling. The methodology involves historical data collection, energy consumption prediction through machine learning, and the development of scheduling policies with deep neural networks. The project also includes adaptation of workflow management systems and validation in real-world environments, aiming to provide viable solutions to current challenges in HPC.

References

Antici, F., Yamamoto, K., Domke, J., and Kiziltan, Z. (2023). Augmenting ML-based Predictive Modelling with NLP to Forecast a Job’s Power Consumption. In Proceedings of the SC ’23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, pages 1820–1830, Denver CO USA. ACM.

Elshamy, A., Alquraan, A., and Al-Kiswany, S. (2023). A Study of Orchestration Approaches for Scientific Workflows in Serverless Computing. In Proceedings of the 1st Workshop on SErverless Systems, Applications and MEthodologies, pages 34–40, Rome Italy. ACM.

Koslovski, G. P., Pereira, K., and Albuquerque, P. R. (2024). DAG-based workflows scheduling using Actor–Critic Deep Reinforcement Learning. Future Generation Computer Systems, 150:354–363.

Lehmann, F., Bader, J., Tschirpke, F., Thamsen, L., and Leser, U. (2023). How Workflow Engines Should Talk to Resource Managers: A Proposal for a Common Workflow Scheduling Interface. In 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid), pages 166–179. arXiv:2302.07652 [cs].

Liao, X.-k., Lu, K., Yang, C.-q., Li, J.-w., Yuan, Y., Lai, M.-c., Huang, L.-b., Lu, P.-j., Fang, J.-b., Ren, J., and Shen, J. (2018). Moving from exascale to zettascale computing: challenges and techniques. Frontiers of Information Technology & Electronic Engineering, 19(10):1236–1244.

Majid, A. Y. and Marin, E. (2023). A Review of Deep Reinforcement Learning in Serverless Computing: Function Scheduling and Resource Auto-Scaling.

Roy, R. B., Patel, T., Gadepally, V., and Tiwari, D. (2022a). Mashup: making serverless computing useful for HPC workflows via hybrid execution. In Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 46–60, Seoul Republic of Korea. ACM.

Roy, R. B., Patel, T., and Tiwari, D. (2022b). DayDream: Executing Dynamic Scientific Workflows on Serverless Platforms with Hot Starts. In SC22: International Conference for High Performance Computing, Networking, Storage and Analysis, pages 1–18, Dallas, TX, USA. IEEE.

Su, L. and Naffziger, S. (2023). 1.1 Innovation For the Next Decade of Compute Efficiency. In 2023 IEEE International Solid-State Circuits Conference (ISSCC), pages 8–12, San Francisco, CA, USA. IEEE.

Zhang, D., Dai, D., He, Y., Bao, F. S., and Xie, B. (2020). RLScheduler: an automated HPC batch job scheduler using reinforcement learning. In Proceedings of the international conference for high performance computing, networking, storage and analysis, Sc ’20. IEEE Press.
Published
2024-05-16
ROSA, Lucas; GOLDMAN, Alfredo. Energy-Aware Scheduling for Serverless Scientific Workflows: A Machine Learning Approach. In: REGIONAL SCHOOL OF HIGH PERFORMANCE COMPUTING FROM SÃO PAULO (ERAD-SP), 15. , 2024, Rio Claro/SP. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 89-92. DOI: https://doi.org/10.5753/eradsp.2024.239934.

Most read articles by the same author(s)

1 2 3 4 > >>