Bromonschenkel, G., Oliveira, H., & Paixão, T. (2024). A Comparative Evaluation of Transformer-Based Vision Encoder-Decoder Models for Brazilian Portuguese Image Captioning. In Proceedings of the 37th Conference on Graphics, Patterns and Images. Porto Alegre: SBC.