Silva, J., Boldt, F., Souza Jr, L., Berger, M., Frizera, A., Souza, A., Oliveira-Santos, T., & Badue, C. (2025). Evaluating Transformer-Based Architectures for Simultaneous Audio Speech Transcription and Background Audio Captioning. In Proceedings of the 52nd Integrated Software and Hardware Seminar, (pp. 633-644). Porto Alegre: SBC. doi:10.5753/semish.2025.9474