International Journal of Advanced Technology and Engineering Exploration (IJATEE) ISSN (P): 2394-5443 ISSN (O): 2394-7454 Vol - 8, Issue - 84, November 2021
  1. 1
    Google Scholar
A systematic literature review on student performance predictions

Hasnah Nawang, Mokhairi Makhtar and Wan Mohd Amir Fazamin Wan Hamza

Abstract

Prediction of student performance in educational institutions is a major topic of debate among researchers in efforts to improve teaching and learning. Effective prediction techniques and features would help educators and teachers design appropriate teaching content to help learners study according to predicted outcomes. The purpose of this paper is to present a systematic literature review on predictions of students’ performance in higher education institutions and secondary schools using Machine Learning, Educational Data Mining, and Learning Analytics methodologies. The review used in this study was designed to: i) provide an overview of techniques and algorithms used to predict students' performance; and ii) identify the features that have the greatest impact on students' performance. This paper also outlined several future insights in terms of applying hybrid techniques to educational datasets in order to improve accuracy in predicting students’ performance.

Keyword

Educational data mining, Machine learning, Learning analytics, Students, Performance prediction.

Cite this article

Nawang H, Makhtar M, Hamza WM

Refference

[1][1]Gaftandzhieva S, Docheva M, Doneva R. A comprehensive approach to learning analytics in Bulgarian school education. Education and Information Technologies. 2021; 26(1):145-63.

[2][2]Alyahyan E, Düştegör D. Predicting academic success in higher education: literature review and best practices. International Journal of Educational Technology in Higher Education. 2020; 17(1):1-21.

[3][3]Ferreira SA, Andrade A. Academic analytics: anatomy of an exploratory essay. Education and Information Technologies. 2016; 21(1):229-43.

[4][4]Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E. Deep learning applications and challenges in big data analytics. Journal of Big Data. 2015; 2(1):1-21.

[5][5]Khatib KC, Kamble TD, Chendake BR, Sonavane GN. Social media data mining for sentiment analysis. International Research Journal of Engineering and Technology. 2016; 3(4):373-6.

[6][6]Ozdemir D, Opseth HM, Taylor H. Leveraging learning analytics for student reflection and course evaluation. Journal of Applied Research in Higher Education. 2019; 12(1):27-37.

[7][7]Nuutila K, Tuominen H, Tapola A, Vainikainen MP, Niemivirta M. Consistency, longitudinal stability, and predictions of elementary school students task interest, success expectancy, and performance in mathematics. Learning and Instruction. 2018; 56:73-83.

[8][8]Lang C, Siemens G, Wise A, Gasevic D. Handbook of learning analytics. New York, NY, USA: SOLAR, Society for Learning Analytics and Research; 2017.

[9][9]Nawang H, Makhtar M, Shamsudin SN. Classification model and analysis on students performance. Journal of Fundamental and Applied Sciences. 2017; 9(6S):869-85.

[10][10]Tsai YS, Gasevic D. Learning analytics in higher education-challenges and policies: a review of eight learning analytics policies. In proceedings of the seventh international learning analytics & knowledge conference 2017 (pp. 233-42).

[11][11]Shahiri AM, Husain W. A review on predicting students performance using data mining techniques. Procedia Computer Science. 2015; 72:414-22.

[12][12]Kitchenham B, Brereton OP, Budgen D, Turner M, Bailey J, Linkman S. Systematic literature reviews in software engineering–a systematic literature review. Information and Software Technology. 2009; 51(1):7-15.

[13][13]Gil PD, Da CMS, Moro S, Costa JM. A data-driven approach to predict first-year students’ academic success in higher education institutions. Education and Information Technologies. 2021; 26(2):2165-90.

[14][14]Qazdar A, Er-raha B, Cherkaoui C, Mammass D. A machine learning algorithm framework for predicting students performance: a case study of baccalaureate students in Morocco. Education and Information Technologies. 2019; 24(6):3577-89.

[15][15]Costa-mendes R, Oliveira T, Castelli M, Cruz-jesus F. A machine learning approximation of the 2015 Portuguese high school student grades:a hybrid approach. Education and Information Technologies. 2021; 26(2):1527-47.

[16][16]Baars GJ, Stijnen T, Splinter TA. A model to predict student failure in the first year of the undergraduate medical curriculum. Health Professions Education. 2017; 3(1):5-14.

[17][17]Youssef M, Mohammed S, Hamada EK, Wafaa BF. A predictive approach based on efficient feature selection and learning algorithms’ competition: case of learners’ dropout in MOOCs. Education and Information Technologies. 2019; 24(6):3591-618.

[18][18]Kostopoulos G, Kotsiantis S, Verykios VS. A prognosis of junior high school students’ performance based on active learning methods. In international conference on brain function assessment in learning 2017 (pp. 67-76). Springer, Cham.

[19][19]Moreno-marcos PM, Pong TC, Munoz-merino PJ, Kloos CD. Analysis of the factors influencing learners’ performance prediction with learning analytics. IEEE Access. 2020; 8:5264-82.

[20][20]Al-obeidat F, Tubaishat A, Dillon A, Shah B. Analyzing students’ performance using multi-criteria classification. Cluster Computing. 2018; 21(1):623-32.

[21][21]Asif R, Merceron A, Ali SA, Haider NG. Analyzing undergraduate students performance using educational data mining. Computers & Education. 2017; 113:177-94.

[22][22]Yousafzai BK, Hayat M, Afzal S. Application of machine learning and data mining in predicting the performance of intermediate and secondary education level student. Education and Information Technologies. 2020; 25(6):4677-97.

[23][23]Adekitan AI, Noma-osaghae E. Data mining approach to predicting the performance of first year student in a university using the admission requirements. Education and Information Technologies. 2019; 24(2):1527-43.

[24][24]Azcona D, Hsiao IH, Smeaton AF. Detecting students-at-risk in computer programming classes with learning analytics from students’ digital footprints. User Modeling and User-Adapted Interaction. 2019; 29(4):759-88.

[25][25]Akçapınar G, Hasnine MN, Majumdar R, Flanagan B, Ogata H. Developing an early-warning system for spotting at-risk students by using eBook interaction logs. Smart Learning Environments. 2019; 6(1):1-15.

[26][26]Hussain S, Dahan NA, Ba-alwib FM, Ribata N. Educational data mining and analysis of students’ academic performance using WEKA. Indonesian Journal of Electrical Engineering and Computer Science. 2018; 9(2):447-59.

[27][27]Adekitan AI, Salau O. The impact of engineering students performance in the first three years on their graduation result using educational data mining. Heliyon. 2019; 5(2):1-21.

[28][28]Marbouti F, Diefes-dux HA, Madhavan K. Models for early prediction of at-risk students in a course using standards-based grading. Computers & Education. 2016; 103:1-15.

[29][29]Altujjar Y, Altamimi W, Al-turaiki I, Al-razgan M. Predicting critical courses affecting students performance: a case study. Procedia Computer Science. 2016; 82:65-71.

[30][30]Zhou Q, Quan W, Zhong Y, Xiao W, Mou C, Wang Y. Predicting high-risk students using internet access logs. Knowledge and Information Systems. 2018; 55(2):393-413.

[31][31]Aydoğdu Ş. Predicting student final performance using artificial neural networks in online learning environments. Education and Information Technologies. 2020; 25(3):1913-27.

[32][32]Karlos S, Kostopoulos G, Kotsiantis S. Predicting and interpreting students’ grades in distance higher education through a semi-regression method. Applied Sciences. 2020; 10(23):1-19.

[33][33]Iqbal MS, Luo B. Prediction of educational institution using predictive analytic techniques. Education and Information Technologies. 2019; 24(2):1469-83.

[34][34]Zohair LM. Prediction of students performance by modelling small dataset size. International Journal of Educational Technology in Higher Education. 2019; 16(1):1-8.

[35][35]Ma X, Zhou Z. Student pass rates prediction using optimized support vector machine and decision tree. In 8th annual computing and communication workshop and conference 2018 (pp. 209-15). IEEE.

[36][36]Hashim AS, Awadh WA, Hamoud AK. Student performance prediction model based on supervised machine learning algorithms. In IOP conference series: materials science and engineering 2020 (pp. 1-19). IOP Publishing.

[37][37]Hamsa H, Indiradevi S, Kizhakkethottam JJ. Student academic performance prediction model using decision tree and fuzzy genetic algorithm. Procedia Technology. 2016; 25:326-32.

[38][38]Pandey M, Taruna S. Towards the integration of multiple classifier pertaining to the students performance prediction. Perspectives in Science. 2016; 8:364-6.

[39][39]Badr G, Algobail A, Almutairi H, Almutery M. Predicting students performance in university courses: a case study and tool in KSU mathematics department. Procedia Computer Science. 2016; 82:80-9.

[40][40]Akçapınar G, Altun A, Aşkar P. Using learning analytics to develop early-warning system for at-risk students. International Journal of Educational Technology in Higher Education. 2019; 16(1):1-20.

[41][41]Hussain M, Zhu W, Zhang W, Abidi SM, Ali S. Using machine learning to predict student difficulties from learning session data. Artificial Intelligence Review. 2019; 52(1):381-407.

[42][42]Zheng G, Fancsali SE, Ritter S, Berman S. Using instruction-embedded formative assessment to predict state summative test scores and achievement levels in mathematics. Journal of Learning Analytics. 2019; 6(2):153-74.

[43][43]Rovira S, Puertas E, Igual L. Data-driven system to predict academic grades and dropout. PLoS One. 2017; 12(2).

[44][44]Rodríguez-muñiz LJ, Bernardo AB, Esteban M, Díaz I. Dropout and transfer paths: what are the risky profiles when analyzing university persistence with machine learning techniques?. Plos One. 2019; 14(6):1-21.

[45][45]Francis BK, Babu SS. Predicting academic performance of students using a hybrid data mining approach. Journal of Medical Systems. 2019; 43(6):1-5.

[46][46]Aiken JM, De BR, Hjorth-jensen M, Caballero MD. Predicting time to graduation at a large enrollment American university. Plos One. 2020; 15(11).

[47][47]Hussain M, Zhu W, Zhang W, Abidi SM. Student engagement predictions in an e-learning system and their impact on student course assessment scores. Computational Intelligence and Neuroscience. 2018:1-21.

[48][48]Czibula G, Mihai A, Crivei LM. S PRAR: a novel relational association rule mining classification model applied for academic performance prediction. Procedia Computer Science. 2019; 159:20-9.

[49][49]Matzavela V, Alepis E. Decision tree learning through a predictive model for student academic performance in intelligent M-learning environments. Computers and Education: Artificial Intelligence. 2021.

[50][50]Viloria A, López JR, Leyva DM, Vargas-mercado C, Hernández-palma H, Llinas NO, et al. Data mining techniques and multivariate analysis to discover patterns in university final researches. Procedia Computer Science. 2019; 155:581-6.

[51][51]Deng H, Wang X, Guo Z, Decker A, Duan X, Wang C, et al. Performancevis: visual analytics of student performance data from an introductory chemistry course. Visual Informatics. 2019; 3(4):166-76.

[52][52]Çetinkaya A, Baykan ÖK. Prediction of middle school students programming talent using artificial neural networks. Engineering Science and Technology, an International Journal. 2020; 23(6):1301-7.

[53][53]Mokhairi M, Nawang H, Wan SN. Analysis on students performance using naïve. Journal of Theoretical and Applied Information Technology. 2017; 31(16):3993-4000.

[54][54]Hu H, Zhang G, Gao W, Wang M. Big data analytics for MOOC video watching behavior based on Spark. Neural Computing and Applications. 2020; 32(11):6481-9.

[55][55]Slater S, Joksimović S, Kovanovic V, Baker RS, Gasevic D. Tools for educational data mining: a review. Journal of Educational and Behavioral Statistics. 2017; 42(1):85-106.

[56][56]Breiman L. Random forests. Machine Learning. 2001; 45(1):5-32.

[57][57]Yusuf A. Prediction of students’performance in E-learning environment using random forest. Doctoral Dissertation, University of Technology Malaysia.

[58][58]Noble WS. What is a support vector machine?. Nature Biotechnology. 2006; 24(12):1565-7.

[59][59]Gupta N. Artificial neural network. Network and Complex Systems. 2013; 3(1):24-8.

[60][60]Hamoud A, Hashim AS, Awadh WA. Predicting student performance in higher education institutions using decision tree analysis. International Journal of Interactive Multimedia and Artificial Intelligence. 2018; 5:26-31.

[61][61]Zulfiker MS, Kabir N, Biswas AA, Chakraborty P, Rahman MM. Predicting students performance of the private universities of Bangladesh using machine learning approaches. International Journal of Advanced Computer Science and Applications. 2020; 11(3):672-9.

[62][62]Sivakumar S, Venkataraman S, Selvaraj R. Predictive modeling of student dropout indicators in educational data mining using improved decision tree. Indian Journal of Science and Technology. 2016; 9(4):1-5.

[63][63]Rish I. An empirical study of the naive Bayes classifier. In IJCAI workshop on empirical methods in artificial intelligence 2001 (pp. 41-6).

[64][64]Sokkhey P, Okazaki T. Hybrid machine learning algorithms for predicting academic performance. International Journal of Advanced Computer Science and Applications. 2020; 11(1):32-41.

[65][65]Dole L, Rajurkar J. A Decision support system for predicting student performance. International Journal of Innovative Research in Computer and Communication Engineering. 2014; 2(12):7232-7.

[66][66]Mohamad M, Makhtar M, Abd RMN. The reconstructed heterogeneity to enhance ensemble neural network for large data. In international conference on soft computing and data mining 2016 (pp. 447-55). Springer, Cham