Silva, João, Francisco de Assis Boldt, Luis A. Souza Jr, Mariella Berger, Anselmo Frizera, Alberto F. De Souza, Thiago Oliveira-Santos, and Claudine Badue. " Evaluating Transformer-Based Architectures for Simultaneous Audio Speech Transcription and Background Audio Captioning." Anais do LII Seminário Integrado de Software e Hardware, Maceió/AL, 2025. SBC, 2025, pp.633-644.