Além da Conexão: Combinando Múltiplas Fontes de Dados para Entender e Prever Evasão de Internet Residencial

  • Vitor F. Zanotelli UFES
  • Wadham Bottacin UFES
  • Matheus S. De Martin UFES
  • Pedro de Morais UFES
  • Giovanni Comarela UFES
  • Rodolfo Villaça UFES
  • Vinícius F. S. Mota UFES
  • Antonio A. de A. Rocha UFF

Resumo


A retenção de usuários é uma preocupação crescente entre os provedores de acesso à Internet residencial. Nesse contexto, este trabalho explora dados internos de uma multinacional de telecomunicações e da Anatel para treinamento de modelos de aprendizado de máquina visando a previsão de evasão de seus clientes. Uma análise inicial mostra o desempenho desses modelos, quem alcançaram resultados de acurácia próxima aos 80%, e precisão e recall na faixa dos 70%. Também são identificadas as características mais influentes na decisão de saída de um cliente, viabilizando a implementação da abordagem proposta em estratégias para mitigação do problema de evasão.

Referências

Anatel (2022). Infographic: Overview of telecommunication in brazil - march 2022. [link]. [Online; acessado em janeiro de 2024].

Baeza-Yates, R. and Ribeiro-Neto, B. (2011). Modern Information Retrieval: The Concepts and Technology behind Search. Addison-Wesley Publishing Company, USA, 2nd edition.

Bertaglia, T. F. C. and Nunes, M. d. G. V. (2016). Exploring word embeddings for unsupervised textual user-generated content normalization. In Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT), pages 112–120.

Bhuse, P., Gandhi, A., Meswani, P., Muni, R., and Katre, N. (2020). Machine learning based telecom-customer churn prediction. In 2020 3rd International Conference on Intelligent Sustainable Systems (ICISS), pages 1297–1301.

Bilal, F., Syed, Almazroi, A., Abdulwahab, Bashir, Saba, Khan, H., Farhan, Almazroi, A., and Abdulaleem (2022). An ensemble based approach using a combination of clustering and classification algorithms to enhance customer churn prediction in telecom industry. PeerJ Computer Science, 8:e854.

Caigny, A. D., Coussement, K., Bock, K. W. D., and Lessmann, S. (2020). Incorporating textual information in customer churn prediction models based on a convolutional neural network. International Journal of Forecasting, 36(4):1563–1578.

Choudhari, A. S. and Potey, M. (2018). Predictive to prescriptive analysis for customer churn in telecom industry using hybrid data mining techniques. In 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), pages 1–6.

Lalwani, P., Mishra, M.K., Chadha, and et al., J. (2022). Customer churn prediction system: a machine learning approach. Computing, 104(271–294).

Lu, N., Lin, H., Lu, J., and Zhang, G. (2014). A customer churn prediction model in telecom industry using boosting. IEEE Transactions on Industrial Informatics, 10(2):1659.

Lundberg, S. M. and Lee, S.-I. (2017). A unified approach to interpreting model predictions. Advances in neural information processing systems, 30.

Micci-Barreca, D. (2001). A preprocessing scheme for high-cardinality categorical attributes in classification and prediction problems. SIGKDD Explor. Newsl.

Pimentel, T. P. and Goldschmidt, R. R. (2019). Sequential sentiment pattern mining to predict churn in crm systems: A case study with telecom data. In Proceedings of the XV Brazilian Symposium on Information Systems (SBSI’19), pages Article 11, 1–8, New York, NY, USA. Association for Computing Machinery.

Slof, D., Frasincar, F., and Matsiiako, V. (2021). A competing risks model based on latent dirichlet allocation for predicting churn reasons. Decision Support Systems, 146:113541.

Souza, F., Nogueira, R., and Lotufo, R. (2020). BERTimbau: pretrained BERT models for Brazilian Portuguese. In 9th Brazilian Conference on Intelligent Systems, BRACIS, Rio Grande do Sul, Brazil, October 20-23 (to appear).

Stehani, S., Karunya, N., Ranjan, D. R. J. B., Sumathipala, S., and Sandanayake, T. C. (2020). Customer churn reasoning in telecommunication domain. In 2020 International Conference on Image Processing and Robotics (ICIP), pages 1–5.

Ullah, I., Raza, B., Malik, A. K., Imran, M., Islam, S. U., and Kim, S. W. (2019). A churn prediction model using random forest: Analysis of machine learning techniques for churn prediction and factor identification in telecom sector. IEEE Access, 7:60134–60149.

Wu, S., Yau, W. C., Ong, T. S., and Chong, S. C. (2021). Integrated churn prediction and customer segmentation framework for telco business. IEEE Access, 9:62118–62136.

Yucesan, M., Edwine, N., Wang, W., Song, W., and Ssebuggwawo, D. (2022). Detecting the risk of customer churn in telecom sector: A comparative study. Mathematical Problems in Engineering, 2022:8534739.

Zaki, M. J. and Jr, W. M. (2014). Data Mining and Analysis: Fundamental Concepts and Algorithms. Cambridge University Press, USA.

Óskarsdóttir, M., Bravo, C., Verbeke, W., Sarraute, C., Baesens, B., and Vanthienen, J. (2017). Social network analytics for churn prediction in telco: Model building, evaluation and network architecture. Expert Systems With Applications, 85:204–220.

Özköse, Y. E., Haznedaroğlu, A., and Arslan, L. M. (2021). Customer churn analysis with deep learning methods on unstructured data. In 2021 Innovations in Intelligent Systems and Applications Conference (ASYU), pages 1–5.
Publicado
20/05/2024
ZANOTELLI, Vitor F.; BOTTACIN, Wadham; MARTIN, Matheus S. De; MORAIS, Pedro de; COMARELA, Giovanni; VILLAÇA, Rodolfo; MOTA, Vinícius F. S.; ROCHA, Antonio A. de A.. Além da Conexão: Combinando Múltiplas Fontes de Dados para Entender e Prever Evasão de Internet Residencial. In: SIMPÓSIO BRASILEIRO DE REDES DE COMPUTADORES E SISTEMAS DISTRIBUÍDOS (SBRC), 42. , 2024, Niterói/RJ. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 323-336. ISSN 2177-9384. DOI: https://doi.org/10.5753/sbrc.2024.1366.

Artigos mais lidos do(s) mesmo(s) autor(es)

<< < 1 2 3