Customer churn prediction model in enterprises using machine learning
Yamini B, K. Venkata Ramana, M. Nalini, D. Chitra Devi, B. Maheswari and Siva Subramanian. R
Abstract
The customer is a vital and integral component of all organizations worldwide. The success of any business hinges on its ability to engage customers effectively in terms of the products or services offered. Research into customer churn is a critical element in the operations of any enterprise. Customer churn prediction (CCP) helps to understand customer interactions with a business and often to identify when customers are likely to cease doing business with the company. In this study, Machine Learning (ML) algorithms are utilized for effective CCP. This study considers various supervised learning models, as the dataset used is labeled. The ML models employed include logistic regression (LR), k-nearest neighbors (k-NN), decision tree (DT), random forest (RF), XGBoost (XGB), light gradient boosting machine (LightGBM), and CatBoost. The ML model is applied to a dataset of customer churn sourced from the Kaggle repository. The results of the experiments are evaluated using several validity metrics, such as accuracy, recall, precision, area under the curve (AUC), and F1-score. The experimental data reveal that LR excels in terms of recall (0.5275) and accuracy (0.581), while the CatBoost model leads in AUC (0.8415), precision (0.6564), and F1-score (0.581). Moreover, LightGBM achieves results close to those of LR and CatBoost. The research findings indicate that the use of ML contributes to the prediction of customer churn. Additionally, the experimental results suggest that LR, CatBoost, and LightGBM outperform other ML models. Utilizing this knowledge, enterprises can develop more effective strategies for customer retention and enhance their business's financial performance.
Keyword
Customer churn, Enterprises, Prediction, ML, CatBoost, LightGBM, Logistic regression.
Cite this article
Yamini B, Ramana KV, Nalini M, Devi DC, Maheswari B, Subramanian. SR.Customer churn prediction model in enterprises using machine learning. International Journal of Advanced Technology and Engineering Exploration. 2024;11(110):94-107. DOI:10.19101/IJATEE.2023.10101581
Refference
[1]Mishra A, Reddy US. A comparative study of customer churn prediction in telecom industry using ensemble based classifiers. In international conference on inventive computing and informatics 2017 (pp. 721-5). IEEE.
[2]Qi J, Zhang Y, Zhang Y, Shi S. TreeLogit model for customer churn prediction. In Asia-Pacific conference on services computing 2006 (pp. 70-5). IEEE.
[3]Prabha D, Subramanian RS. A survey on customer relationship management. In 4th international conference on advanced computing and communication systems 2017 (pp. 1-5). IEEE.
[4]Hopkins E. Machine learning tools, algorithms, and techniques. Journal of Self-Governance and Management Economics. 2022; 10(1):43-55.
[5]Muneer A, Ali RF, Alghamdi A, Taib SM, Almaghthawi A, Ghaleb EA. Predicting customers churning in banking industry: a machine learning approach. Indonesian Journal of Electrical Engineering and Computer Science. 2022; 26(1):539-49.
[6]Sudharsan R, Ganesh EN. A swish RNN based customer churn prediction for the telecom industry with a novel feature selection strategy. Connection Science. 2022; 34(1):1855-76.
[7]Lalwani P, Mishra MK, Chadha JS, Sethi P. Customer churn prediction system: a machine learning approach. Computing. 2022:1-24.
[8]Ahmad AK, Jafar A, Aljoumaa K. Customer churn prediction in telecom using machine learning in big data platform. Journal of Big Data. 2019; 6(1):1-24.
[9]Vo NN, Liu S, Li X, Xu G. Leveraging unstructured call log data for customer churn prediction. Knowledge-Based Systems. 2021; 212:106586.
[10]Xiahou X, Harada Y. B2C E-commerce customer churn prediction based on K-means and SVM. Journal of Theoretical and Applied Electronic Commerce Research. 2022; 17(2):458-75.
[11]Zhang T, Moro S, Ramos RF. A data-driven approach to improve customer churn prediction based on telecom customer segmentation. Future Internet. 2022; 14(3):1-19.
[12]AL-najjar D, Al-rousan N, AL-najjar H. Machine learning to develop credit card customer churn prediction. Journal of Theoretical and Applied Electronic Commerce Research. 2022; 17(4):1529-42.
[13]Fujo SW, Subramanian S, Khder MA. Customer churn prediction in telecommunication industry using deep learning. Information Sciences Letters. 2022; 11(1):185-98.
[14]Rahman M, Kumar V. Machine learning based customer churn prediction in banking. In 4th international conference on electronics, communication and aerospace technology 2020 (pp. 1196-201). IEEE.
[15]Saheed YK, Hambali MA. Customer churn prediction in telecom sector with machine learning and information gain filter feature selection algorithms. In international conference on data analytics for business and industry 2021 (pp. 208-13). IEEE.
[16]Thorat MA, Sonawane VR. Customer churn prediction in the telecom industry using machine learning algorithms. Computer Integrated Manufacturing Systems. 2023; 29(4):1-11.
[17]Kumar S, Logofatu D. Comparative study on customer churn prediction by using machine learning techniques. In Asian conference on intelligent information and database systems 2023 (pp. 339-51). Cham: Springer Nature Switzerland.
[18]Awasthi S. Customer churn prediction on E-commerce data using stacking classifier. Authorea Preprints. 2023:1-10.
[19]Liu Y, Shengdong M, Jijian G, Nedjah N. Intelligent prediction of customer churn with a fused attentional deep learning model. Mathematics. 2022; 10(24):1-16.
[20]Jahan I, Sanam TF. An improved machine learning based customer churn prediction for insight and recommendation in E-commerce. In 25th international conference on computer and information technology 2022 (pp. 1-6). IEEE.
[21]Swetha P, Dayananda RB. A customer churn prediction model in telecom industry using improved_XGBoost. International Journal of Cloud Computing. 2023; 12(2-4):277-94.
[22]Pandithurai O, Ahmed HH, Sriman B, Seetha R. Telecom customer churn prediction using supervised machine learning techniques. In international conference on advances in computing, communication and applied informatics 2023 (pp. 1-7). IEEE.
[23]Angelina JJ, Subhashini SJ, Baba SH, Reddy PD, Reddy PS, Khan KS. A machine learning model for customer churn prediction using CatBoost classifier. In 7th international conference on intelligent computing and control systems 2023 (pp. 166-72). IEEE.
[24]Galal M, Rady S, Aref M. Enhancing customer churn prediction in digital banking using ensemble modeling. In 4th novel intelligent and leading emerging sciences conference 2022 (pp. 21-5). IEEE.
[25]Agarwal V, Taware S, Yadav SA, Gangodkar D, Rao AL, Srivastav VK. Customer-churn prediction using machine learning. In 2nd international conference on technological advancements in computational sciences 2022 (pp. 893-9). IEEE.
[26]Tsai TY, Lin CT, Prasad M. An intelligent customer churn prediction and response framework. In 14th international conference on intelligent systems and knowledge engineering 2019 (pp. 928-35). IEEE.
[27]Günesen SN, Şen N, Yıldırım N, Kaya T. Customer churn prediction in FMCG sector using machine learning applications. In IFIP international workshop on artificial intelligence for knowledge management 2021 (pp. 82-103). Cham: Springer International Publishing.
[28]Siva SR, Prabha D. Optimizıng naive bayes probability estimation in customer analysis using hybrid variable selection. In computer networks and inventive communication technologies: proceedings of third ICCNCT 2021 (pp. 595-612). Springer Singapore.
[29]Subramanian RS, Prabha D, Aswini J, Maheswari B, Anita M. Alleviating NB conditional independence using multi-stage variable selection (MSVS): banking customer dataset application. In journal of physics: conference series 2021 (pp. 1-10). IOP Publishing.
[30]Subramanian RS, Prabha D, Maheswari B, Aswini J. Customer analysis using machine learning algorithms: a case study using banking consumer dataset. Recent Trends in Intensive Computing. 2021; 689-94.
[31]Subramanian RS, Prabha D. Ensemble variable selection for naive bayes to improve customer behaviour analysis. Computer Systems Science & Engineering. 2022; 41(1):339-55.
[32]Al DE. Comparison between XGBoost, LightGBM and CatBoost using a home credit dataset. International Journal of Computer and Information Engineering. 2019; 13(1):6-10.
[33]Maheswari B, Bushra SN, Nirmala G, Anita M, Smys S, Kamel KA, et al. Enhancing customer prediction using machine learning with feature selection approaches. Inventive Computation and Information Technologies. Lecture Notes in Networks and Systems. 2023; 563.
[34]Prabha D, Subramanian RS, Balakrishnan S, Karpagam M. Performance evaluation of naive bayes classifier with and without filter based feature selection. International Journal of Innovative Technology and Exploring Engineering. 2019; 8(10):2154-8.