Costa, L., & Oliveira e Souza Filho, J. 2025 set 29. LLM Agents for Search via Reinforcement Learning with Trajectory-Level Self-Evaluation. Anais do Encontro Nacional de InteligĂȘncia Artificial e Computacional (ENIAC). [Online] :