L. Costa and J. Oliveira e Souza Filho.
" LLM Agents for Search via Reinforcement Learning with Trajectory-Level Self-Evaluation", in Anais do XXII Encontro Nacional de InteligĂȘncia Artificial e Computacional, Fortaleza/CE, 2025, pp. 1221-1232, doi: https://doi.org/10.5753/eniac.2025.14460.