Costa, Leandro, and João Baptista de Oliveira e Souza Filho. " LLM Agents for Search via Reinforcement Learning with Trajectory-Level Self-Evaluation." Anais do XXII Encontro Nacional de Inteligência Artificial e Computacional, Fortaleza/CE, 2025. SBC, 2025, pp.1221-1232.