CATTAI, Pedro; BALDASSIN, Alexandro; DANTAS, Allberson.
Inference Optimization for LLMs on CPUs: Analysis of the Current Landscape. In: REGIONAL SCHOOL OF HIGH PERFORMANCE COMPUTING FROM SÃO PAULO (ERAD-SP), 16. , 2025, São José do Rio Preto/SP.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2025
.
p. 78-81.
DOI: https://doi.org/10.5753/eradsp.2025.9731.