Optimizing Image Identification: Discriminative Keypoint Detection with Equivariant CNN and SSIM-Based Triplet Loss

  • Wagner L. O. Santos UFF
  • Lizeth J. F. Perez UZH
  • Esteban W. G. Clua UFF
  • Renato Pajarola UZH
  • Anselmo A. Montenegro UFF

Resumo


Image identification becomes particularly challenging in datasets characterized by high intra-class similarity and minimal structural variation, such as wood textures, security paper, or metal alloys. In these contexts, effective discrimination depends on capturing fine-grained textural cues. We address this with a novel keypoint detector built upon an Equivariant Convolutional Neural Network, trained using a triplet loss function guided by the Structural Similarity Index. This design encourages the extraction of features that are not only equivariant to common transformations but also highly discriminative across visually similar instances. A central contribution of our method is the generation of keypoints with high repeatability - an attribute we show to be closely tied to improved identification accuracy. Through comprehensive experiments, we demonstrate that our approach consistently outperforms state-of-the-art methods in both matching and identification tasks across multiple datasets.
Palavras-chave: Visualization, Accuracy, Feature detection, Semantics, Metals, Feature extraction, Robustness, Convolutional neural networks, Security, Indexes
Publicado
30/09/2025
SANTOS, Wagner L. O.; PEREZ, Lizeth J. F.; CLUA, Esteban W. G.; PAJAROLA, Renato; MONTENEGRO, Anselmo A.. Optimizing Image Identification: Discriminative Keypoint Detection with Equivariant CNN and SSIM-Based Triplet Loss. In: CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 38. , 2025, Salvador/BA. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2025 . p. 74-79.