Towards Green Scientific Data Compression Through High-Level I/O Interfaces
Resumo
Every HPC system today has to cope with a deluge of data generated by scientific applications, simulations or large-scale experiments. The upscaling of supercomputer systems and infrastructures, generally results in a dramatic increase of their energy consumption. In this paper, we argue that techniques like data compression can lead to significant gains in terms of power efficiency by reducing both network and storage requirements. However, any data reduction is highly data specific and should comply with established requirements. Therefore, unsuitable or inappropriate compression strategy can utilize more resources and energy than necessary. To that end, we propose a novel methodology for achieving on-the-fly intelligent determination of energy efficient data reduction for a given data set by leveraging state-of-the-art compression algorithms and meta data at application-level I/O. We motivate our work by analyzing the energy and storage saving needs of data sets from real-life scientific HPC applications, and review the various lossless compression techniques that can be applied. We find that the resulting data reduction can decrease the data volume transferred and stored by as much as 80 % in some cases, consequently leading to significant savings in storage and networking costs.
Palavras-chave:
Energy consumption, Libraries, Data compression, Data models, Compression algorithms, Meteorology, Supercomputers, HPC, I/O interfaces, data reduction, compression, energy consumption
Publicado
24/09/2018
Como Citar
ALFOROV, Yevhen; LUDWIG, Thomas; NOVIKOVA, Anastasiia; KUHN, Michael; KUNKEL, Julian.
Towards Green Scientific Data Compression Through High-Level I/O Interfaces. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 30. , 2018, Lyon/FR.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2018
.
p. 209-216.
