Silva, J., Boldt, F., Souza Jr, L., Berger, M., Frizera, A., Souza, A., Oliveira-Santos, T., & Badue, C. (2025). Evaluating Transformer-Based Architectures for Simultaneous Audio Speech Transcription and Background Audio Captioning. In Anais do LII SeminĂ¡rio Integrado de Software e Hardware, (pp. 633-644). Porto Alegre: SBC. doi:10.5753/semish.2025.9474