Detecting Inconsistencies in Public Bids: An Automated and Data-based Approach

  • Gabriel P. Oliveira UFMG
  • Arthur P. G. Reis UFMG
  • Felipe A. N. Freitas UFMG
  • Lucas L. Costa UFMG
  • Mariana O. Silva UFMG
  • Pedro P. V. Brum UFMG
  • Samuel E. L. Oliveira UFMG
  • Michele A. Brandão UFMG / IFMG
  • Anisio Lacerda UFMG
  • Gisele L. Pappa UFMG


One application for using government data is the detection of irregularities that may indicate fraud in the public sector. This paper presents an approach that analyzes public bidding data available on the Web to detect bidder inconsistencies. Specifically, we propose a hierarchical decision approach from public bidding data, where each bidder is classified as Valid, Doubtful, or Invalid, based on the compatibility between the bidding items and the divisions of the CNAE codes (National Classification of Economic activities). The results reveal that combining commonly available data on bidders and extracting the description of bid items can help in fraud detection. Furthermore, the proposed approach can reduce the number of bids a specialist must analyze to detect fraud, making it easier to identify inconsistencies.

Palavras-chave: Human-Computer Interaction, Assistive Technology, Accessibility, Autism spectrum disorders


Joao Paulo A Andrade 2021. A Machine Learning-based System for Financial Fraud Detection. In ENIAC (Online). SBC, Porto Alegre, Brasil, 165–176.

Carlos Assis, Adriano Pereira, Marconi Pereira, and Eduardo Carrano. 2013. Using genetic programming to detect fraud in electronic transactions. In WebMedia (Salvador, Brasil). ACM, New York, USA, 337–340.

Bart Baesens, Sebastiaan Höppner, and Tim Verdonck. 2021. Data engineering for fraud detection. Decision Support Systems 150 (2021), 113492.

Olivier Caelen. 2017. A Bayesian interpretation of the confusion matrix. Annals of Mathematics and Artificial Intelligence 81, 3(2017), 429–450.

Lucas L. Costa 2022. Alertas de fraude em licitações: Uma abordagem baseada em redes sociais. In BraSNAM (Niterói, Brasil). SBC, Porto Alegre, Brasil, 37–48.

Geanderson Esteves 2020. Understanding machine learning software defect predictions. Automated Software Engineering 27, 3 (2020), 369–392.

Rayelle Ingrid Vera Cruz Silva Muniz and Bernadette Farias Lóscio. 2018. Publicação de Dados Abertos Conectados Sobre os Transplantes Realizados no IMIP. In SBBD WTDBD (Rio de Janeiro, Brasil). SBC, Porto Alegre, Brasil. [link].

Ana Luiza Pedrosa Paschoal, Nálbia Araújo Santos, and Walmer Faroni. 2020. Diamante da fraude: evidências empíricas nos relatórios de demandas externas do Ministério da Transparência e Controladoria Geral da União (CGU) dos municípios brasileiros. Revista Ambiente Contábil 12, 2 (2020), 136–156.

Rinky D. Patel and Dheeraj Kumar Singh. 2013. Credit card fraud detection & prevention of fraud using genetic algorithm. Intl. Journal of Soft Comp. and Eng. 2, 6 (2013), 292–294.

G. Jaculine Priya and S. Saradha. 2021. Fraud Detection and Prevention Using Machine Learning Algorithms: A Review. In ICEES(Chennai, India). IEEE, 564–568.

Efrat Shimron 2022. Implicit data crimes: Machine learning bias arising from misuse of public data. the National Academy of Sciences 119, 13 (2022), e2117203119.

Glauco Vasconcelos Soares and Rodrigo Cunha. 2020. Predição de Irregularidade Fiscal dos Contribuintes do Tributo ISS. In SBBD (Online). SBC, Porto Alegre, Brasil, 223–228.

Adriano Veloso 2003. Efficient, Accurate and Privacy-Preserving Data Mining for Frequent Itemsets in Distributed Databases. In SBBD (Manaus, Brasil). SBC, Porto Alegre, Brasil, 281–292.

Allyson Vilela, André Almeida, and Frederico Lopes. 2018. OpenData Processor: An Automation tool for the process of extracting and publishing open data to CKAN. In WebMedia WFA (Salvador, Brasil). SBC, Porto Alegre, Brasil, 97–101.
OLIVEIRA, Gabriel P. et al. Detecting Inconsistencies in Public Bids: An Automated and Data-based Approach. In: SIMPÓSIO BRASILEIRO DE SISTEMAS MULTIMÍDIA E WEB (WEBMEDIA), 28. , 2022, Curitiba. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2022 . p. 193-201.


1 2 > >>