Experimental study on algorithmic fairness applied to credit analysis models

  • Tiago A. Oliveira Federal University of Bahia (UFBA)
  • João V. L. Oliveira Federal University of Bahia (UFBA)
  • Tarcísio P. Farias Federal Institute of Education, Science, and Technology of Bahia (IFBA)
  • Erick W. R. Cruz Federal Institute of Education, Science, and Technology of Bahia (IFBA)
  • Leandro J. S. Andrade Federal University of Bahia (UFBA)
  • Robespierre Pita Federal University of Bahia (UFBA)

Abstract


Machine Learning (ML) models for algorithmic decision-making are widely applied to support risk management and credit analysis. However, the significant increase in available data, the complexity of modern models, and public scrutiny surrounding artificial intelligence have intensified the debate on the need to identify and mitigate biases in predictions. This study aims to analyze the relationship between quantitative measures of algorithmic fairness and quality metrics obtained by ML models in credit analysis tasks. Initial results indicate that certain models can achieve promising performance levels without necessarily affecting or deteriorating fairness in their predictions.

Keywords: justiça algoritmica, machine learning, análise de crédito

References

Akkizidis, I. and Stagars, M. (2015). Marketplace lending, Financial Analysis, and the Future of credit: Integration, Profitability, and risk management. John Wiley & Sons.

Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J., and Vanthienen, J. (2003). Benchmarking state-of-the-art classification algorithms for credit scoring. JORS), 54(6):627–635.

Bellamy, R. K., Hind, M., Mojsilović, A., et al. (2019). Ai fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias. IBM JRD, 63(4/5):4–1.

Bhatore, S., Mohan, L., and Reddy, Y. R. (2020). Machine learning techniques for credit risk evaluation: a systematic literature review. Journal of Banking and Financial Technology, 4(1):111–138.

Caton, S. and Haas, C. (2024). Fairness in machine learning: A survey. ACM Computing Surveys, 56(7):1–38.

Derelioğlu, G., Gürgen, F., and Okay, N. (2009). A neural approach for sme’s credit risk analysis in turkey. pages 749–759. Springer.

Dwork, C., Hardt, M., Pitassi, T., Reingold, O., and Zemel, R. (2012). Fairness through awareness. In Proceedings of the 3rd ITCS, pages 214–226.

Guidolin, M. and Pedio, M. (2021). Sharpening the accuracy of credit scoring models with machine learning algorithms. In Data Science for Economics and Finance: Methodologies and Applications, pages 89–115. Springer.

Gyamfi, N. K. and Abdulai, J.-D. (2018). Bank fraud detection using support vector machine. In 2018 IEEE 9th IEMCON, pages 37–41. IEEE.

Hardt, M., Price, E., Price, E., and Srebro, N. (2016). Equality of opportunity in supervised learning. In Lee, D., editor, NIPS, volume 29. Curran Associates, Inc.

He, Y., Burghardt, K., and Lerman, K. (2020). A geometric solution to fair representations. In Proceedings of the AAAI/ACM, AIES ’20, page 279–285, USA. ACM.

Hofmann, H. (1994). Statlog (german credit data).

Kasmi, M. L. (2021). Machine learning fairness in finance: An application to credit scoring. Diss., Tilburg University.

Kozodoi, N., Jacob, J., and Lessmann, S. (2022). Fairness in credit scoring: Assessment, implementation and profit implications. EJOR, 297(3):1083–1094.

Lai, K. K. and Zhou, L. (2006). Neural network metalearning for credit scoring. In International Conference on Intelligent Computing, pages 403–408. Springer.

Lessmann, S., Baesens, B., Seow, H.-V., and Thomas, L. C. (2015). Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. EJOR, 247(1):124–136.

Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., and Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6):1–35.

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Perrot, M., and Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. JMLR, 12:2825–2830.

Ruback, L., Carvalho, D., and Avila, S. (2022). Mitigando vieses no aprendizado de máquina: Uma análise sociotécnica. iSys, 15(1):23–1.

Sadok, H., Sakka, F., and El Maknouzi, M. E. H. (2022). Artificial intelligence and bank credit analysis: A review. Cogent Economics, 10(1):2023262.

Sathye, M. et al. (2003). Credit analysis & lending management. Wiley.

Szwabe, A. and Misiorek, P. (2018). Decision trees as interpretable bank credit scoring models. In BDAS 2018, Poland, September 18-20, 2018, Proceedings 14, pages 207–219. Springer.

West, D., Dellana, S., and Qian, J. (2005). Neural network ensemble strategies for financial decision applications. COR, 32(10):2543–2559.

Xia, Y., Liu, C., Li, Y., and Liu, N. (2017). A boosted decision tree approach using bayesian hyper-parameter optimization for credit scoring. Expert systems with applications, 78:225–241.

Zhou, L., Lai, K. K., and Yu, L. (2010). Least squares support vector machines ensemble models for credit scoring. Expert systems with applications, 37(1):127–133.
Published
2024-10-14
OLIVEIRA, Tiago A.; OLIVEIRA, João V. L.; FARIAS, Tarcísio P.; CRUZ, Erick W. R.; ANDRADE, Leandro J. S.; PITA, Robespierre. Experimental study on algorithmic fairness applied to credit analysis models. In: WORKSHOP ON UNDERGRADUATE STUDENT WORK (WTAG) - BRAZILIAN SYMPOSIUM ON DATABASES (SBBD), 39. , 2024, Florianópolis/SC. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 29-36. DOI: https://doi.org/10.5753/sbbd_estendido.2024.243797.