Talent management by predicting employee attrition using enhanced weighted forest optimization algorithm with improved random forest classifier
S. Porkodi, S. Srihari and N. Vijayakumar
Abstract
Predictive analysis has been an important field of research suitable for a wide range of applications covering a huge volume of domains in predicting the future with the current and past data. In an organisation, the predicted insights are highly helpful in analysing all the aspects of an issue and making decisions suitably. More specifically, talent management requires making an appropriate decision in employing and maintaining suitable skills in the appropriate place. Machine learning algorithms are most commonly used in analysing the attributes that affect employee attrition and predicting employee turnover. This paper presents the prediction model that makes use of an enhanced weight-based forest optimization algorithm. It employs mutual information for selecting the significant features and a modified random forest for classifying the attrition results. The experimental analysis has been performed with the International Business Machines (IBM) human resource employee attrition dataset and the results are compared with the other existing models. The analysis shows that the proposed model offers better results with an accuracy of 91.23% and a minimum error rate of 8.77% than several other models. The feature significance helps in making effective steps in retaining the talents for the benefit of the organization.
Keyword
Talent management, Employee attrition, Predictive analytics, Machine learning, Random forest, Forest optimization algorithm.
Cite this article
Porkodi S, Srihari S, Vijayakumar N.Talent management by predicting employee attrition using enhanced weighted forest optimization algorithm with improved random forest classifier. International Journal of Advanced Technology and Engineering Exploration. 2022;9(90):563-582. DOI:10.19101/IJATEE.2021.875340
Refference
[1]Schweyer A. Talent management systems: best practices in technology solutions for recruitment, retention and workforce planning. John Wiley & Sons; 2004.
[2]Castellano WG. Practices for engaging the 21st century workforce: challenges of talent management in a changing workplace. FT Press; 2013.
[3]Lewis RE, Heckman RJ. Talent management: a critical review. Human Resource Management Review. 2006; 16(2):139-54.
[4]Dibble S. Keeping your valuable employees: retention strategies for your organizations most important resource. John Wiley & Sons; 1999.
[5]Abdurakhmanova G, Shayusupova N, Irmatova A, Rustamov D. The role of the digital economy in the development of the human capital market. The Research Archive. 2020; 24(7): 8043-51.
[6]Sisodia DS, Vishwakarma S, Pujahari A. Evaluation of machine learning models for employee churn prediction. In international conference on inventive computing and informatics 2017 (pp. 1016-20). IEEE.
[7]Maisuradze M. Predictive analysis on the example of employee turnover. Tallinn University of Technology. 2017.
[8]Nocker M, Sena V. Big data and human resources management: the rise of talent analytics. Social Sciences. 2019; 8(10):1-19.
[9]Karande S, Shyamala L. Prediction of employee turnover using ensemble learning. In ambient communications and computer systems 2019 (pp. 319-27). Springer, Singapore.
[10]Yadav S, Jain A, Singh D. Early prediction of employee attrition using data mining techniques. In international advance computing conference 2018 (pp. 349-54). IEEE.
[11]Poornappriya TS, Gopinath R. Employee attrition in human resource using machine learning techniques. Webology. 2021; 18(6):2844-56.
[12]Bama SS, Saravanan A. Efficient classification using average weighted pattern score with attribute rank based feature selection. International Journal of Intelligent Systems and Applications. 2019; 11(7):29-42.
[13]Khaire UM, Dhanalakshmi R. Stability of feature selection algorithm: a review. Journal of King Saud University-Computer and Information Sciences. 2022; 34(4):1060-73.
[14]Uthayakumar J, Metawa N, Shankar K, Lakshmanaprabu SK. Financial crisis prediction model using ant colony optimization. International Journal of Information Management. 2020; 50:538-56.
[15]Xiao L. Optimal allocation model of enterprise human resources based on particle swarm optimization. In international conference on computer information and big data applications 2020(pp. 249-53). IEEE.
[16]Sankhwar S, Gupta D, Ramya KC, Sheeba RS, Shankar K, Lakshmanaprabu SK. Improved grey wolf optimization-based feature subset selection with fuzzy neural classifier for financial crisis prediction. Soft Computing. 2020; 24(1):101-10.
[17]Moorthy U, Gandhi UD. Forest optimization algorithm‐based feature selection using classifier ensemble. Computational Intelligence. 2020; 36(4):1445-62.
[18]Razmjooy N, Sheykhahmad FR, Ghadimi N. A hybrid neural network–world cup optimization algorithm for melanoma detection. Open Medicine. 2018; 13(1):9-16.
[19]Pereira GT, Santos BZ, Cerri R. A genetic algorithm for transposable elements hierarchical classification rule induction. In congress on evolutionary computation 2018 (pp. 1-8). IEEE.
[20]Chiclana F, Kumar R, Mittal M, Khari M, Chatterjee JM, Baik SW. ARM–AMO: an efficient association rule mining algorithm based on animal migration optimization. Knowledge-Based Systems. 2018; 154:68-80.
[21]Kamath DR, Jamsandekar DS, Naik DP. Machine learning approach for employee attrition analysis. International Journal of Trend in Scientific Research and Development. 2019:62-7.
[22]Zhao Y, Hryniewicki MK, Cheng F, Fu B, Zhu X. Employee turnover prediction with machine learning: a reliable approach. In proceedings of SAI intelligent systems conference 2018 (pp. 737-58). Springer, Cham.
[23]Jain R, Nayyar A. Predicting employee attrition using xgboost machine learning approach. In international conference on system modeling & advancement in research trends 2018 (pp. 113-20). IEEE.
[24]Pratt M, Boudhane M, Cakula S. Employee attrition estimation using random forest algorithm. Baltic Journal of Modern Computing. 2021; 9(1):49-66.
[25]Salunkhe TP. Improving employee retention by predicting employee attrition using machine learning techniques (Doctoral dissertation, Dublin Business School). 2018.
[26]Bindra H, Sehgal K, Jain R. Optimisation of C5. 0 using association rules and prediction of employee attrition. In international conference on innovative computing and communications 2019 (pp. 21-9). Springer, Singapore.
[27]Sehgal K, Bindra H, Batra A, Jain R. Prediction of employee attrition using GWO and PSO optimised models of C5. 0 used with association rules and analysis of optimisers. In innovations in computer science and engineering 2019 (pp. 1-8). Springer, Singapore.
[28]Jain PK, Jain M, Pamula R. Explaining and predicting employees’ attrition: a machine learning approach. SN Applied Sciences. 2020; 2(4):1-11.
[29]Srivastava PR, Eachempati P. Intelligent employee retention system for attrition rate analysis and churn prediction: an ensemble machine learning and multi-criteria decision-making approach. Journal of Global Information Management. 2021; 29(6):1-29.
[30]Devi GD, Kamalakkannan S. Prediction of job satisfaction from the employee using ensemble method. In international conference on advanced computing technologies and applications 2022 (pp. 1-8). IEEE.
[31]Sharma MK, Singh D, Tyagi M, Saini A, Dhiman N, Garg R. Employee retention and attrition analysis: a novel approach on attrition prediction using fuzzy inference and ensemble machine learning. Webology. 2022; 19(2):5338-58.
[32]Rombaut E, Guerry MA. The effectiveness of employee retention through an uplift modeling approach. International Journal of Manpower. 2020; 41(8):1199-220.
[33]Wijaya D, DS JH, Barus S, Pasaribu B, Sirbu LI, Dharma A. Uplift modeling VS conventional predictive model: areliable machine learning model to solve employee turnover. International Journal of Artificial Intelligence Research. 2021; 5(1):53-64.
[34]Nagadevara V. Prediction of employee attrition using work-place related variables. 2012.
[35]Singh M, Varshney KR, Wang J, Mojsilovic A, Gill AR, Faur PI, Ezry R. An analytics approach for proactively combating voluntary attrition of employees. In international conference on data mining workshops 2012 (pp. 317-23). IEEE.
[36]Ghaemi M, Feizi-derakhshi MR. Forest optimization algorithm. Expert Systems with Applications. 2014; 41(15):6676-87.
[37]Ghaemi M, Feizi-derakhshi MR. Feature selection using forest optimization algorithm. Pattern Recognition. 2016; 60:121-9.
[38]Han J, Pei J, Kamber M. Data mining: concepts and techniques. Elsevier; 2011.
[39]Mohanty F, Rup S, Dash B, Majhi B, Swamy MN. Mammogram classification using contourlet features with forest optimization-based feature selection approach. Multimedia Tools and Applications. 2019; 78(10):12805-34.
[40]Huang CL, Wang CJ. A GA-based feature selection and parameters optimization for support vector machines. Expert Systems with Applications. 2006; 31(2):231-40.
[41]Zhao M, Fu C, Ji L, Tang K, Zhou M. Feature selection and parameter optimization for support vector machines: a new approach based on genetic algorithm with feature chromosomes. Expert Systems with Applications. 2011; 38(5):5197-204.
[42]Kachouie NN, Shutaywi M. Weighted mutual information for aggregated kernel clustering. Entropy. 2020; 22(3):1-15.
[43]Selvam RRLP, Saleem IAM, Alenezi A. Classification of imbalanced class distribution using random forest with multiple weight based majority voting for credit scoring. International Journal of Recent Technology and Engineering. 2019; 7(6S5):517-26.
[44]Zhang C, Wang X, Chen S, Li H, Wu X, Zhang X. A modified random forest based on kappa measure and binary artificial bee colony algorithm. IEEE Access. 2021; 9:117679-90.
[45]Birant KU. Multi‐view rank‐based random forest: a new algorithm for prediction in eSports. Expert Systems. 2022; 39(2).
[46]https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset. Accessed 26 November 2021.
[47]Bama SS, Ahmed MI, Saravanan A. A mathematical approach for mining web content outliers using term frequency ranking. Indian Journal of Science and Technology. 2015; 8(14): 1-5.
[48]García S, Luengo J, Herrera F. Data preprocessing in data mining. Cham, Switzerland: Springer International Publishing; 2015.
[49]Gopalsamy A, Radha B. Feature selection using multiple ranks with majority vote-based relative aggregate scoring model for Parkinson dataset. In proceedings of international conference on data science and applications 2022 (pp. 1-19). Springer, Singapore.
[50]Jo JM. Effectiveness of normalization pre-processing of big data to the machine learning performance. The Journal of the Korea Institute of Electronic Communication Sciences. 2019; 14(3):547-52.
[51]Bama SS, Ahmed MI, Saravanan A. A survey on performance evaluation measures for information retrieval system. International Research Journal of Engineering and Technology. 2015; 2(2):1015-20.
[52]Sathya BS, Ahmed I, Saravanan A. Relevance re-ranking through proximity based term frequency model. In international conference on ICT innovations 2016 (pp. 219-29). Springer, Cham.