MH-AutoML: Transparência, Interpretabilidade e Desempenho na Detecção de Malware Android

Joner Assolin; Gabriel Canto; Diego Kreutz; Eduardo Feitosa

doi:10.5753/sbseg_estendido.2024.243362

Joner Assolin UFAM
Gabriel Canto UFAM
Diego Kreutz UNIPAMPA
Eduardo Feitosa UFAM

DOI: https://doi.org/10.5753/sbseg_estendido.2024.243362

Resumo

A MH-AutoML é uma ferramenta de AutoML especializada na detecção de malware Android. Diferentemente de outras ferramentas de AutoML, a MH-AutoML incorpora recursos de transparência, interpretabilidade e depuração em todos os estágios do pipeline. A ferramenta também inclui métodos de seleção de caracteŕısticas espećıficos para o domínio e otimizações de hiperparâmetros que geram bons resultados. Os resultados indicam que a MH-AutoML produz modelos preditivos competitivos (e.g., 95% de recall com baixo custo computacional) em comparação com modelos gerados por outras 7 ferramentas de AutoML.

Referências

Assolin, J. et. al. (2024). MH-AutoML. [link].

Erickson, N. et. al. (2020). Autogluon-tabular: Robust and accurate automl for structured data. arXiv preprint arXiv:2003.06505.

Guyon, I. et. al. (2015). Design of the 2015 chalearn automl challenge. In International Joint Conference on Neural Networks (IJCNN), pages 1–8.

Jin, H., Chollet, F., Song, Q., and Hu, X. (2023). Autokeras: An automl library for deep learning. Journal of machine Learning research, 24(6):1–6.

Karmaker S. et. al. (2021). Automl to date and beyond: Challenges and opportunities. ACM Computing Surveys, 54(8).

LeDell, E. and Poirier, S. (2020). H2o automl: Scalable automatic machine learning. In Proceedings of the AutoML Workshop at ICML, volume 2020.

Lee, D. J.-L. and Macke, S. (2020). A human-in-the-loop perspective on automl: Milestones and the road ahead. IEEE Data Engineering Bulletin.

Molino P. et. al. (2019). Ludwig: a type-based declarative deep learning toolbox.

Nasimian, A. et. al. (2024). Alphaml: A clear, legible, explainable, transparent, and elucidative binary classification platform for tabular data. Patterns, 5(1).

Olson, R. S. and Moore, J. H. (2016). TPOT: A tree-based pipeline optimization tool for automating machine learning. In Workshop on automatic machine learning.

Shi, X. et. al. (2021). An automated machine learning (automl) method of risk prediction for decision-making of autonomous vehicles. IEEE TITS, 22(11):7145.

Xin, D. et. al. (2021). Whither automl? understanding the role of automation in machine learning workflows. In Proceedings of the CHI.

Yan, C. et. al. (2022). Privacy-preserving online automl for domain-specific face detection. In IEEE CVF, pages 4134–4144.

Zimmer, L., Lindauer, M., and Hutter, F. (2000). Auto-pytorch tabular: Multi-fidelity metalearning for efficient and robust autodll. arxiv 2020. arXiv preprint arXiv:2006.13799.

MH-AutoML: Transparência, Interpretabilidade e Desempenho na Detecção de Malware Android

Resumo

Referências

Artigos mais lidos do(s) mesmo(s) autor(es)