Profiler-Guided Execution of Recurrent OpenMP Task Graphs on Heterogeneous Clusters
Resumo
Distributed task-based execution models are well-suited for parallelizing irregular applications across clusters. OpenMP Cluster (OMPC) extends the traditional OpenMP tasking model to support distributed memory systems, leveraging a HEFT-based scheduler to improve resource utilization. However, the efficiency of such a scheduler depends heavily on accurate estimates of task execution and communication costs – information that is often difficult to obtain reliably and efficiently. To address this limitation, we propose a novel scheduling framework that combines the recent taskgraph directive introduced in OpenMP 6.0 with partial online profiling of iterative applications. Our approach performs quasi-static scheduling by recording task graphs at runtime and selectively profiling representative iterations to estimate performance. This information is interpolated and fed back into the scheduler to enhance decision-making. We demonstrate that our framework can improve scheduling quality with minimal overhead, making it suitable for long-running or repetitive workloads commonly found in High-Performance Computing (HPC) applications. We achieve up to 20% speedup for the total application and 4× speedup for scheduling.
Palavras-chave:
Runtime, Processor scheduling, High performance computing, Decision making, Parallel processing, Heterogeneous networks, Recording, Resource management, Reliability, Iterative methods, heterogeneous computing, high-performance computing, OpenMP, performance profiling, scheduling, task-based parallelism
Publicado
28/10/2025
Como Citar
NEVEU, Rémy; CECCATO, Rodrigo; MUNERA, Adrian; ROYUELA, Sara; DIAZ, Jose M. Monsalve; YVIQUEL, Hervé.
Profiler-Guided Execution of Recurrent OpenMP Task Graphs on Heterogeneous Clusters. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 37. , 2025, Bonito/MS.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2025
.
p. 260-271.
