A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs
Resumo
Jaccard weights are a popular metric for identifying communities in social network analytics. In this paper we present a kernel for efficiently computing the Jaccard weight matrix on G PU s. The kernel design is guided by fine-grained parallelism and the independent thread scheduling supported by NVIDIA's Volta architecture. This technology makes it possible to interleave the execution of divergent branches for enhanced data reuse and a higher instruction per cycle rate for memory-bound algorithms. In a performance evaluation using a set of publicly available social networks, we report the kernel execution time and analyze the built-in hardware counters on different GPU architectures. The findings have implications beyond the specific algorithm and suggest a reformulation of other data-sparse algorithms.
Palavras-chave:
Kernel, Graphics processing units, Sparse matrices, Instruction sets, Memory management, Message systems
Publicado
24/09/2018
Como Citar
ANZT, Hartwig; DONGARRA, Jack.
A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 30. , 2018, Lyon/FR.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2018
.
p. 229-232.
