Memory Bandwidth: What can we improve?

  • Francis Birck Moreira UFPR
  • Marco Antônio Zanata Alves UFPR

Resumo


As the gap between processor performance and memory latency widens, the computer architecture research area is always out for new solutions for efficient memory access. In this paper, we analyze the maximum achievable gain by simulating an ideal DRAM memory that always services requests as row buffer hits, thus minimizing memory access latency and queuing. We reproduced the state-of-the-art ConGen2 technique from Natale et al. to measure how much improvement is still available. ConGen2 minimizes row buffer misses according to a min-k-cut solution based on all memory accesses of a target application. We observed that ConGen2 improves memory usage by 2.30% on average, while the ideal memory gains 36.88% on average for the chosen benchmarks. This gap leaves a wide margin of benefits to be gained.

Referências

Alves, M. A. Z. et al. (2015). Sinuca: A validated micro-architecture simulator. In 2015 IEEE 17th HPCC, pages 605–610. IEEE.

Mutlu, O. et al. (2020). A modern primer on processing in memory. arXiv preprint arXiv:2012.03112.

Natale, V. et al. (2020). Efficient generation of application specific memory controllers. In MEMSYS, pages 233–247.

Santos, P. C. et al. (2021). Survey on near-data processing: Applications and architectures. JICS, 16(2):1–17.
Publicado
18/04/2022
MOREIRA, Francis Birck; ALVES, Marco Antônio Zanata. Memory Bandwidth: What can we improve?. In: ESCOLA REGIONAL DE ALTO DESEMPENHO DA REGIÃO SUL (ERAD-RS), 22. , 2022, Curitiba. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2022 . p. 107-108. ISSN 2595-4164. DOI: https://doi.org/10.5753/eradrs.2022.19185.