Towards Optimizing Computational Costs of Federated Learning in Clouds

  • Rafaela Brum UFF
  • Lúcia Drummond UFF
  • Maria Clicia Castro UERJ
  • George Teodoro UFMG

Resumo


Federated Learning is a strategy where distributed training datasets are processed by several clients coordinated by a central server that keeps the global learning model. This approach is very attractive in the biomedical domain because it allows the use of private datasets from multiple institutions to train a model without the need of sharing the data. Moreover, Machine Learning approaches often require larger training samples than what can be afforded by a single institution. In this work, we are interested in analyzing the performance of a Tumor-Infiltrating Lymphocytes Classification problem when solved by a federated learning approach deployed in a commercial cloud. In the presented evaluation, we consider executions of a federated learning implementation on accelerated instance types of the AWS EC2, on on-demand and spot markets, while varying the number of clients. The obtained results showed the improvement of the accuracy and execution times when the number of clients increases. They also revealed that, although the spot instances suffered from revocations, their use could significantly reduce the financial costs compared to the on-demand one.
Palavras-chave: Training, Costs, Biological system modeling, High performance computing, Conferences, Machine learning, Computer architecture, Cloud Computing, Federated Learning, GPU instances
Publicado
26/10/2021
BRUM, Rafaela; DRUMMOND, Lúcia; CASTRO, Maria Clicia; TEODORO, George. Towards Optimizing Computational Costs of Federated Learning in Clouds. In: WORKSHOP ON CLOUD COMPUTING - INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 33. , 2021, Belo Horizonte. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . p. 35-40.