Dog Face Recognition Using Vision Transformer


The demand for effective, efficient and safe methods for animal identification has been increasing significantly, due to the need for traceability, management, and control of this population, which grows at higher rates than the human population, particularly pets. Motivated by the efficacy of modern human identification methods based on face biometrics features, in this paper, we propose a dog face recognition method based on vision transformers, a deep learning approach that decomposes the input image into a sequence of patches and applies self-attention to these patches to capture spatial relationships between them. Results obtained on DogFaceNet, a public database of dog face images, show that the proposed method, which uses the EfficientFormer-L1 architecture, outperforms the state-of-the-art method proposed previously in literature based on ResNet, a deep convolutional neural network.
CANTO, Victor Hugo Braguim; MANESCO, João Renato Ribeiro; SOUZA, Gustavo Botelho de; MARANA, Aparecido Nilceu. Dog Face Recognition Using Vision Transformer. In: BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 12. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 33-47. ISSN 2643-6264.