Voltar aos Detalhes do Artigo Evaluating Large Language Models through Multidimensional Item Response Theory: A Comprehensive Case Study on ENEM Baixar ##common.downloadPdf##