Online Multimedia Similarity Search with Response Time-Aware Parallelism and Task Granularity Auto-Tuning
Resumo
This paper presents an efficient parallel implementation of the Product Quantization based approximate nearest neighbor multimedia similarity search indexing (PQANNS). The parallel PQANNS efficiently answers nearest neighbor queries by exploiting the ability of the quantization approach to reduce the data dimensionality (and memory demand) and by leveraging parallelism to speed up the search capabilities of the application. Our solution is also optimized to minimize query response times under scenarios with fluctuating query rates (load) as observed in online services. To achieve this goal, we have developed strategies to dynamically select the parallelism configuration and task granularity that minimizes the query response times during the execution. The proposed strategies (ADAPT and ADAPT+G) were thoroughly evaluated and have shown, for instance, to reduce the query response times in 6.4× as compared to the best static configuration of parallelism and task granularity.
Palavras-chave:
Parallel processing, Quantization (signal), Time factors, Multimedia communication, Databases, Nearest neighbor searches, nearest neighbor searchm product quantizationm adaptive parallelism, auto-tuning, task-based runtime
Publicado
17/10/2017
Como Citar
ANDRADE, Guilherme; TEODORO, George; FERREIRA, Renato.
Online Multimedia Similarity Search with Response Time-Aware Parallelism and Task Granularity Auto-Tuning. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 29. , 2017, Campinas/SP.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2017
.
p. 153-160.
