Evaluation of an OPENMP Parallelization of Lucas-Kanade on a NUMA-Manycore
Resumo
Lucas-Kanade algorithm is a well-known optical flow estimator widely used in image processing for motion detection and object tracking. As a typical image processing algorithm, the procedure is a series of convolution masks followed by 2×2 linear systems for the optical flow vectors. Since we are dealing with a stencil computation for each stage of the algorithm, the overhead from memory accesses is expected to stand as a serious scalability bottleneck, especially on a NUMA manycore configuration. The objective of this study is therefore to investigate an openMP parallelization of Lucas-kanade algorithm on a NUMA manycore, including the performance impact of NUMA-aware settings at runtime. Experimental results on a dual-socket INTEL Broadwell-EIEP is provided together with the corresponding technical discussions.
Palavras-chave:
Optical imaging, Optical distortion, Optical sensors, Estimation, Graphics processing units, Scalability, Hardware, Optical flow, Lucas-Kanade, multicore, manycore, openMP NUMA, scalability
Publicado
24/09/2018
Como Citar
HAGGUI, Olfa; TADONKI, Claude; SAYADI, Fatma; OUNI, Bouraoui.
Evaluation of an OPENMP Parallelization of Lucas-Kanade on a NUMA-Manycore. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 30. , 2018, Lyon/FR.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2018
.
p. 436-441.
