Voltar aos Detalhes do Artigo Evaluating Transformer-Based Architectures for Simultaneous Audio Speech Transcription and Background Audio Captioning Baixar ##common.downloadPdf##