Towards Automatic Soybean Cultivar Identification: SoyCult Dataset and Deep Learning Baselines

Eliezer Soares Flores; Marcelo Resende Thielo; Fabio Ronei Rodrigues Padilha

Eliezer Soares Flores UNIPAMPA
Marcelo Resende Thielo UNIPAMPA
Fabio Ronei Rodrigues Padilha Bagé, RS

Resumo

Soybean, the most economically important crop in Brazil, is projected to reach a production of 154 million metric tons in 2023, solidifying its position as the world's largest soybean producer and exporter. In this context, automatic soybean cultivar identification can play a crucial role in numerous applications, including ensuring seed homogeneity and preventing the presence of invasive seeds in crops. To contribute towards these advancements, this paper presents SoyCult, a publicly available dataset comprising seed images for accurate soybean cultivar identification. The dataset is created using a contour-based segmentation method proposed to extract individual seed images from captured input images, encompassing seeds from various cultivars commonly grown in Brazil. The performed experiments first assess a range of baseline systems, which combine a pre-trained convolutional network with a top-level classifier, representing a widely utilized deep learning-based strategy for classification tasks. Subsequently, selected baseline systems are subjected to additional evaluation using the Friedman test, followed by the Nemenyi post-hoc test. The SoyCult dataset, along with the source code developed to support this study, is publicly available online at https://github.com/eliezersflores/SoyCult.