Um método baseado em aprendizado de máquina para previsão da produção de refeições em restaurantes universitários
Resumo
Food waste from university restaurants is a common problem, generating a significant loss of food produced. Estimating the amount of food to be produced is a major challenge due to the behavior of students who frequent restaurants and the number of variables that must be observed. In order to make such estimates considering these variables and this behavior, methods based on Machine Learning (ML) can be explored for this purpose. Three different ML algorithms were tested and evaluated in this context: K-Nearest Neighbors, Random Forest, and Artificial Neural Networks using data from the restaurant of the Universidade Federal de Uberlândia. This study performed an analysis and pre-processing of the data, applying them to the ML algorithms, which were evaluated and compared with the human prediction and an algorithm that uses the average as an estimator. The results showed that in three of the four scenarios, the ML algorithms did better than the human prediction or an algorithm that does not use ML. In additional comparison, the human prediction did better than one of the four scenarios. We concluded that ML might be a reasonable solution in the future to reduce waste in college restaurants.
Referências
Gustavo E. A. P. A. Batista and Diego Furtado Silva. 2009. How k-nearest neighbor parameters affect its performance.
James Bergstra and Y. Bengio. 2012. Random Search for Hyper-Parameter Optimization. The Journal of Machine Learning Research 13 (03 2012), 281–305.
Tianfeng Chai and R. Draxler. 2014. Root mean square error (RMSE) or mean absolute error (MAE)?Geosci. Model Dev. 7 (01 2014). https://doi.org/10.5194/gmd-7-1247-2014
Leonildo Costa e Silva. 2019. Aprendizado de máquina com treinamento continuado aplicado à previsão de demanda de curto prazo: o caso do restaurante universitário da Universidade Federal de Uberlândia. Mestrado em Ciências. https://doi.org/10.14393/ufu.di.2019.2001
Adele Cutler, David Cutler, and John Stevens. 2011. Random Forests. Vol. 45. 157–176. https://doi.org/10.1007/978-1-4419-9326-7_5
Kevin Gurney. 1997. An Introduction to Neural Networks. Taylor& Francis, Inc., USA.
Jenny. Gustavsson, Christel Cederberg, Ulf Sonesson, Robert van Otterdijk, and Alexandre Meybeck. 2011. Global Food Losses and Food Waste- Extent, Causes and Prevention. (01 2011).
Sotiris Kotsiantis, Dimitris Kanellopoulos, and P. Pintelas. 2006. Data Preprocessing for Supervised Learning. International Journal of Computer Science 1 (01 2006), 111–117.
Tom M. Mitchell. 1997. Machine Learning. McGraw-Hill, New York.
Fatemeh Nargesian, Horst Samulowitz, Udayan Khurana, Elias Khalil, and Deepak Turaga. 2017. Learning Feature Engineering for Classification. 2529–2535. https://doi.org/10.24963/ijcai.2017/352
G S K Ranjan, Amar Verma, and Radhika Sudha. 2019. K-Nearest Neighbors and Grid Search CV Based Real Time Fault Monitoring System for Industries. 1–5. https://doi.org/10.1109/I2CT45611.2019.9033691
Rocha, José Celso e Matos, Felipe Delestro e Frei, Fernando. 2011. Utilização de redes neurais artificiais para a determinação do número de refeições diárias de um restaurante universitário. Revista de Nutrição 24 (10 2011), 735 – 742. https://doi.org/10.1590/S1415-52732011000500007
Jan van Rijn and Frank Hutter. 2018. Hyperparameter Importance Across Datasets. 2367–2376. https://doi.org/10.1145/3219819.3220058