An Exploratory Study of Deep Learning for Predicting Computational Tasks Behavior in HPC Systems

  • Alexandre H. L. Porto LNCC
  • Micaella Coelho LNCC
  • Kary Ocaña LNCC
  • Carla Osthoff LNCC
  • Francieli Boito CNRS / LaBRI / University of Bordeaux / Bordeaux INP / INRIA
  • Douglas O. Cardoso Polytechnic Institute of Tomar

Resumo


The scientific gateway BioinfoPortal for bioinformatics applications is hosted in the National Laboratory for Scientific Computing (LNCC) and is coupled to the Santos Dumont (SDumont) supercomputer environment. BioinfoPortal offers a catalog of bioinformatics software that benefits from the parallel and distributed architecture offered by LNCC. Task submissions consume SDumont nodes shared by other users of the supercomputer; thus, it is important they use the best configuration, which is defined as the best choice of the number of threads/nodes to be allocated for every task submission. This article presents an analysis using neural networks to estimate the computational time required to execute bioinformatics software in several scenarios using a pre-configured number of nodes and threads. Our goal is to demonstrate the performance behavior of software such as RAxML in Bioinfoportal, and which computational scenario can be chosen to efficiently execute software in SDumont. Results support that the neural networks are adequate to predict the variable elapsed time, Elapsed, to evaluate the relationships between input parameters, number of bootstraps (RAxML), number of threads, and number of nodes, and to identify the fastest configuration. The goal is to make BioinfoPortal a smart, efficient, and green gateway. In future studies, we propose to study more variables and predictors as well as other bioinformatics software in BioinfoPortal.
Palavras-chave: neural networks, phylogenetic analysis, extra trees, performance prediction, performance modeling
Publicado
17/10/2023
PORTO, Alexandre H. L.; COELHO, Micaella; OCAÑA, Kary; OSTHOFF, Carla; BOITO, Francieli; CARDOSO, Douglas O.. An Exploratory Study of Deep Learning for Predicting Computational Tasks Behavior in HPC Systems. In: CHICKEN-EGG HPC/DL WORKSHOP - INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 35. , 2023, Porto Alegre/RS. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 9-15.