Using Skip Graphs for Increased NUMA Locality

  • Samuel Thomas Davidson College
  • Roxana Hayne Davidson College
  • Jonad Pulaj Davidson College
  • Hammurabi Mendes Davidson College

Abstract


High-performance simulations and parallel frameworks often rely on highly scalable, concurrent data structures for system scalability. With an increased availability of NUMA architectures, we present a technique to promote NUMA-aware data parallelism inside a concurrent data structure, bringing significant quantitative and qualitative improvements on NUMA locality, as well as reduced contention for synchronized memory accesses. Our architecture is based on a data-partitioned, concurrent skip graph indexed by thread-local sequential maps. We implemented maps and relaxed priority queues using such technique. Maps show up to 6x higher CAS locality, up to a 68.6% reduction on the number of remote CAS operations, and an increase from 88.3% to 99% on the CAS success rate compared to a control implementation (subject to the same optimizations, and implementation practices). Remote memory accesses are not only reduced in number, but the larger the NUMA distance between threads, the larger the reduction is. Relaxed priority queues implemented using our technique show similar scalability improvements, with provable reduction in contention and decrease in relaxation in one of our implementations.
Keywords: High performance computing, Computer architecture, NUMA, concurrent data structures, skip graphs, locality
Published
2020-09-08
THOMAS, Samuel; HAYNE, Roxana; PULAJ, Jonad; MENDES, Hammurabi. Using Skip Graphs for Increased NUMA Locality. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 32. , 2020, Porto/Portugal. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2020 . p. 157-166.