A Memory Affinity Analysis of Scientific Applications on NUMA Platforms
Resumo
Understanding the underlying architecture is essential for scientific applications in general. An example of a computing environment is Non-Uniform Memory Access (NUMA) systems that enable a large amount of shared main memory. Nevertheless, NUMA systems can impose significant access latencies on data communications between distant memory nodes. Parallel applications with a naïve design may suffer significant performance penalties due to the lack of locality mechanisms. In this paper we present performance metrics on scientific applications to identify locality problems in NUMA systems and show data and thread mapping strategies to mitigate them. Our experiments were performed with four well-known scientific applications: CoMD, LBM, LULESH and Ondes3D. Experimental results demonstrate that scientific applications had significant locality problems and data and thread mapping strategies improved performance on all four applications.
Palavras-chave:
Measurement, Instruction sets, High performance computing, Conferences, Memory management, Data communication, Memory Affinity, NUMA, Scientific Applications
Publicado
26/10/2021
Como Citar
TRINDADE, Rafael Gauna; LIMA, João V. F.; CHARÃO, Andrea Schwertner.
A Memory Affinity Analysis of Scientific Applications on NUMA Platforms. In: WORKSHOP ON APPLICATIONS FOR MULTI-CORE ARCHITECTURES (WAMCA) - INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 33. , 2021, Belo Horizonte.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2021
.
p. 1-8.