Detecção de ataques de phishing em tempo real utilizando algoritmos de aprendizado de máquina

João A. Souza; Dalbert M. Mascarenhas

doi:10.5753/sbseg_estendido.2023.234608

João A. Souza CEFET-RJ
Dalbert M. Mascarenhas CEFET-RJ

DOI: https://doi.org/10.5753/sbseg_estendido.2023.234608

Resumo

Phishing é um tipo de ataque cibernético de engenharia social que visa roubar informações de usuários. Com a pandemia de COVID-19 e popularização do modelo de trabalho remoto, o número de ataques cibernéticos aumentou, especialmente os de phishing. Embora diversas soluções anti-phishing existam, como blacklists e heurísticas, os ataques estão em constante adaptação. Este trabalho propõe um modelo baseado em aprendizado de máquina para detecção em tempo real de ataques de phishing com um enfoque no desempenho. Foram testados seis algoritmos dentre os quais o melhor resultado foi obtido pelo SVM com 99,19% de acurácia.

Referências

Berrar, D. et al. (2019). Cross-validation.

Khonji, M., Iraqi, Y., and Jones, A. (2013). Phishing detection: a literature survey. IEEE Communications Surveys & Tutorials, 15(4):2091–2121.

Nagunwa, T., Kearney, P., and Fouad, S. (2022). A machine learning approach for detecting fast flux phishing hostnames. Journal of Information Security and Applications, 65:103125.

Nielsen, J. (1994). Usability engineering. Morgan Kaufmann.

Ozimek, A. (2020). The future of remote work. Available at SSRN 3638597.

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830.

Pranggono, B. and Arabo, A. (2021). Covid-19 pandemic cybersecurity issues. Internet Technology Letters, 4(2):e247.

Rao, R. S. and Pais, A. R. (2019). Detection of phishing websites using an efficient feature-based machine learning framework. Neural Computing and Applications, 31:3851–3873.

Sahingoz, O. K., Buber, E., Demir, O., and Diri, B. (2019). Machine learning based phishing detection from urls. Expert Systems with Applications, 117:345–357.

Shahrivari, V., Darabi, M. M., and Izadi, M. (2020). Phishing detection using machine learning techniques. arXiv preprint arXiv:2009.11116.

Stalmans, E. and Irwin, B. (2011). A framework for dns based detection and mitigation of malware infections on a network. In 2011 Information Security for South Africa, pages 1–8. IEEE.

Zhang, Y., Hong, J. I., and Cranor, L. F. (2007). Cantina: a content-based approach to detecting phishing web sites. In Proceedings of the 16th international conference on World Wide Web, pages 639–648.