Improving the Discovery and Clustering of Three-Dimensional Protein Patterns with OpenMP

  • Alejandro Valdés-Jiménez Universidad del Bío-Bío
  • Miguel Reyes-Parada Universidad de Santiago de Chile
  • Gabriel Nuñez-Vivanco University of Aysen
  • Fabio Durán-Verdugo Universidad de Talca
  • Daniel Jiménez-González Universitat Politecnica de Catalunya

Resumo


The discovery of conserved three-dimensional (3D) amino-acid patterns among a set of protein structures can be useful, for instance, to predict the functions of unknown proteins or for the rational design of multi-target drugs. There are several applications that perform a three-dimensional search of patterns in the structures of proteins. However, discovering conserved 3D patterns in a set of proteins with no other baseline patterns is a challenge. In this paper, we analyze and improve a state-of-the-art algorithm, 3D-PP, that implements this discovery. In this algorithm, the 3D patterns are detected and clustered using the root mean square deviation value, measured among each pair of 3D patterns (topological variability indicator). Even when 3D-PP deals with this task, the simultaneous processing of high amounts of proteins becomes a computational challenge with the size and the number of proteins to be evaluated. In this work, we present and analyze different shared memory parallel strategies of 3D-PP, using OpenMP. Those strategies improve the overall performance of the original implementation by reducing parallel load unbalance among threads and overall increasing parallelism. The results show significant performance improvements compared to the original version, achieving up to 13x speedup for a small number of proteins and 17.7× for a larger set.
Palavras-chave: OpenMP, performance optimization, three-dimensional protein patterns, drug-design
Publicado
17/10/2023
VALDÉS-JIMÉNEZ, Alejandro; REYES-PARADA, Miguel; NUÑEZ-VIVANCO, Gabriel; DURÁN-VERDUGO, Fabio; JIMÉNEZ-GONZÁLEZ, Daniel. Improving the Discovery and Clustering of Three-Dimensional Protein Patterns with OpenMP. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 35. , 2023, Porto Alegre/RS. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 202-208.