T. Lira et al." Aroeira: A Curated Corpus for the Portuguese Language with a Large Number of Tokens", in Anais da XXXIV Brazilian Conference on Intelligent Systems, Belém/PA, 2024, pp. 185-199.