Building a Dataset Related to Production and Marketing of Horticulture Products in Brazil
Abstract
This paper describes the process of building a dataset that gathers public data related to the production and marketing of horticulture and fruticulture products in Brazil, extracted from various sources using the Web Scraping process. To compose the initial version of the dataset, data was extracted from the 2010 Demographic Census, the Brazilian Institute of Geography and Statistics' (IBGE) Automatic Recovery System (SIDRA), and the National Supply Company (CONAB). Finally, a description of the extracted data and potential use cases is presented
References
Diouf, Rabiyatou et al. (2019) Web scraping: state-of-the-art and areas of application. In: IEEE International Conference on Big Data (Big Data). IEEE. p. 6040-6042.
Medeiros, A. M. A., Gonçalves, E. C. (2023) Estudo Comparativo de Estratégias para o Pareamento de Nomes de Entidades na Língua Portuguesa. In: Anais XVIII ERBD.
Meira, C. A. A. et al. (2002) Análise da produção brasileira de frutas a partir do armazém de dados da fruticultura. Campinas, SP: Embrapa. 6 p. Disponível em: [link]. Acesso em: jun/23
