Lovatto, Ã., Bueno, T., & Barros, L. 2021 nov 29. Gradient Estimation in Model-Based Reinforcement Learning: A Study on Linear Quadratic Environments. Anais da Brazilian Conference on Intelligent Systems (BRACIS). [Online] :