Experimental Evaluation of Quantization Methods in Facial Recognition

  • Carlos Henrique Monteiro UFMS
  • Evandro Raphaloski IFRJ
  • Edson Takashi Matsubara UFMS

Resumo


Quantization techniques are recognized for their effective in optimizing large language models. However, their application in smaller language models, such as those used for face recognition, is uncommon. In the field of facial recognition, these techniques can be utilized across a diverse range of settings, from API services in data centers to mobile application deployment. Smaller quantized models can reduce costs while maintaining a strong performance. However, their combination with facial recognition is uncommon. This study tested the impact of 8-bit quantization on embedding generation models for facial recognition. We compared two models: FaceNet (Inception-ResNet) and TransFace (Vision Transformer). We analyzed different precision formats (FP32 and INT8) and inference backends (Torch and ONNX) using datasets such as LFW, VGGFace2, and CelebA to evaluate the top-1 accuracy and cosine distance for similarity. The results show that FaceNet performs well under quantization, maintaining accuracy while reducing precision. In contrast, TransFace exhibited a significant decrease in performance. Quantizing embedding vectors can also reduce storage needs by up to 80% without much loss in performance. These findings support quantization as an effective strategy for optimizing models in resource-limited environments and enhancing facial recognition technology.
Palavras-chave: Data centers, Quantization (signal), Accuracy, Costs, Face recognition, Large language models, High performance computing, Transformers, Vectors, Mobile applications, FaceNet, TransFace, Quantization, LFW, VGGFace2, CelebA, Torch, ONNX
Publicado
28/10/2025
MONTEIRO, Carlos Henrique; RAPHALOSKI, Evandro; MATSUBARA, Edson Takashi. Experimental Evaluation of Quantization Methods in Facial Recognition. In: WORKSHOP ON LIGHTWEIGHT EFFICIENT DEEP LEARNING IN HPC ENVIRONMENTS (LEANDL-HPC) - INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 37. , 2025, Bonito/MS. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2025 . p. 108-115.