A Sequential Pattern Detection and Sentiment Analysis Combined Approach to the Churn Prediction Problem in Client Relationship Management Environments

  • Thiago P. Pimentel IME
  • Ronaldo R. Goldschmidt


The cost of losing profitable customers in competitive markets is driving companies to engage in customer retention. Therefore, anticipating client churn (i.e., cancellation) becomes essential. Among the researches on churn prediction models, we highlight those that are based on sequential pattern detection. Although promising, such initiatives do not take into account the sentiments present in the client’s interactions with the company. Given the above, this article proposes a method that generates churn prediction models from the combination of sequential pattern detection with sentiment extraction from the interactions with the clients. Experimental results confirm the adequacy of the proposed method.


A. Payne, P. F. (2005). A strategic framework for customer relationship management. pages 167––176. Journal of Marketing Research, 69.

Agrawal, R. and Srikant, R. (1994). Fast algorithms for mining association rules. pages 478–479. Proceedings of the 20th international conference on very large data bases.

Buckinx, W. and Van Den Poel, D. (2005). Customer base analysis: Partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting. volume 164, pages 252–268. European journal of operational research.

Burez, J. and Van den Poel, D. (2007). CRM at a pay-TV company: Using analytical models to reduce customer attrition by targeted marketing for subscription services. volume 32, pages 277–288. Expert Systems with Applications.

Chiang, D. A.,Wang, Y. F., Lee, S. L., and Lin, C. J. (2003). Goal-oriented sequential pattern for network banking churn analysis. volume 25, pages 293–302. Expert Systems with Applications.

Coussement, K. and De Bock, K. W. (2013). Customer churn prediction in the online gambling industry: The beneficial effect of ensemble learning. volume 66, pages 1629– 1636. Journal of Business Research.

Coussement, K. and Poel, D. V. d. (2009). Improving customer attrition prediction by integrating emotions from client/company interaction emails and evaluating multiple classifiers. volume 36, pages 6127–6134. Expert Systems with Applications.

Coussement, K. and Van den Poel, D. (2008). Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques. volume 34, pages 313–327. Expert Systems with Applications.

De Bock, K. W., Coussement, K., and Van Den Poel, D. (2010). Computational Statistics and Data Analysis Ensemble classification based on generalized additive models. volume 54, pages 1535–1546. Computational Statistics and Data Analysis.

García, D. L., Nebot, , and Vellido, A. (2017). Intelligent data analysis approaches to churn as a business problem: a survey. volume 51, pages 719–774. Knowledge and Information Systems.

Goldschmidt, R. R. and Passos, E. (2005). Data Mining: Um guia prático. Campus, 2nd edition.

Hadden, J., Tiwari, A., Roy, R., and Ruta, D. (2007). Computer assisted customer churn management: State-of-the-art and future trends. volume 34, pages 2902–2917. Computers and Operations Research.

J. Pei, J. Han, B. M.-A. H. P. Q. C. U. D. C. H. (2001). PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth. pages 215––224. IEEE Press.

J. Wang, J. H. (2004). BIDE: Efficient mining of frequent closed sequences. pages 79––90. IEEE Press.

Jenamani, M., Mohapatra, P. K., and Ghose, S. (2003). A stochastic model of e-customer behavior. volume 2, pages 81–94. Electronic Commerce Research and Applications, 1 edition.

Jonker, J. J., Piersma, N., and Van Den Poel, D. (2004). Joint optimization of customer segmentation and marketing policy to maximize long-term profitability. volume 27, pages 159–168. Expert Systems with Applications.

K. Faceli, A. C. Lorena, J. G. A. d. C. (2015). Inteligência Artificial. Uma Abordagem de Aprendizado de Máquina. LTC.

Kumar, D. A. and Ravi, V. (2008). Predicting credit card customer churn in banks using data mining. pages 4–28. International Journal of Information and Decision Sciences.

Larivière, B. and Van Den Poel, D. (2005). Predicting customer retention and profitability by using random forests and regression forests techniques. volume 29, pages 472–484. Expert Systems with Applications.

Liu, D.-R. and Shih, Y.-Y. (2005). Integrating AHP and data mining for product recommendation based on customer lifetime value. volume 42, pages 387–400. Information & Management.

R. Srikant, R. A. (1996). Mining sequential patterns: generalizations and performance improvements. pages 3––17. 5th international conference on extending database technology (EDBT).

Slotnick, S. A. and Sobel, M. J. (2005). Manufacturing lead-time rules: Customer retention versus tardiness costs. volume 163, pages 825–856. European Journal of Operational Research.

T. Mikolov,W. Y. and Zweig, G. (2013). Linguistic regularities in continuous space word representations. pages 746—-751. HLT-NAACL.

Verbeke, W., Dejaeger, K., Martens, D., Hur, J., and Baesens, B. (2012). New insights into churn prediction in the telecommunication sector: A profit driven data mining approach. volume 218, pages 211–229. European Journal of Operational Research.

W. Medhat, A. Hassan, H. K. (2014). . pages 1093–1113. Ain Shams Engineering Journal, v5.

Wu, D. (2009). Supplier selection: A hybrid model using DEA, decision tree and neural network. volume 36, pages 9105–9112. Expert Systems with Applications.

X. Yan, j. H. (2002). gspan: Graph-based substructure pattern mining. ICDM.

Zaki, M. (2001). Spade: an efficient algorithm for mining frequent sequences. pages 31––60. Mach Learn 42.

Zhang, X., Zhu, J., Xu, S., and Wan, Y. (2012). Predicting customer churn through interpersonal influence. volume 28, pages 94–104. Knowledge-Based Systems.

Como Citar

Selecione um Formato
PIMENTEL, Thiago P.; GOLDSCHMIDT, Ronaldo R.. A Sequential Pattern Detection and Sentiment Analysis Combined Approach to the Churn Prediction Problem in Client Relationship Management Environments. In: ENCONTRO NACIONAL DE INTELIGÊNCIA ARTIFICIAL E COMPUTACIONAL (ENIAC), 15. , 2018, São Paulo. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2018 . p. 704-715. ISSN 2763-9061. DOI: https://doi.org/10.5753/eniac.2018.4460.