Enrolment Management in Higher Learning Institutions: Student Retention Prediction

  • Joseph Ngemu Wote Technical Training Institute, Makueni, Kenya
Keywords: Business Intelligence, Retention, Attrition, WEKA, Classifiers


Student retention has become one of the most important priorities for decision makers in higher learning institutions (HLI). The increasing competition for students among tertiary institutions has resulted in a greater emphasis on student retention. Improving student retention starts with a thorough understanding of the reasons behind the attrition. In an effort to address this issue, the study used student demographic and institutional data along with several business intelligence (BI) techniques and analytical tools, to develop prototype to predict likelihood of student persistence or dropout with the goal to identify factors that can be used to identify students who are at risk of dropping out of tertiary institution program. This study used classification models generated using Waikato Environment for Knowledge Analysis (WEKA). The model was built using the 10-fold cross validation, and holdout method (60% of the data was used as training and the remaining as test and validation). Random sampling techniques were used in selecting the datasets. The attribute selection analysis of the models revealed that the student age on entry, parent occupation, health of student and financial variables are among the most important predictors of the phenomenon. Results of the classifiers were compared using accuracy level, confusion matrices and speed of model building benchmarks. The study shows that identifying the relevant student background factors can be incorporated to design a business intelligence system that can serve as valuable tool in predicting student withdrawal or persistence as well as recommend the necessary intervention strategies to adopt, leading to better education efficiency and graduation rate.

Article Views and Downloands Counter

Download data is not yet available.


Berson, A.; Smith, S. and Thearling, K. (2000). Building Data Mining Applications for CRM.New York: McGraw-Hill Professional Publishing.

Carey, K. (2004). A matter of degrees: Improving graduation rates in four-year colleges and universities. New York: The Education Trust.

Frank, E., Hall, M. A., (2011). Data Mining: Practical Machine Learning Tools and Techniquesǁ, 3rd Ed. Morgan Kaufmann.

Hämäläinen, W., & Vinni, M. (2010). Classifiers for educational technology. In C. Romero, S. Ventura, M., Pechenizkiy, R. S., Baker, J. D., (eds.), Handbook of Educational Data Mining, (pp. 54-74). CRC Press.

Watson, H. J., & Wixom, B. H.,(2007). The Current State of Business Intelligence, Computer, vol.40, no. 9, pp. 96-99, September 2007, doi:10.1109/MC.2007.331

Hall, M., Frank, E, Holmes, G., Pfahringer, B., Reutemann, P., Witten, I. H., (2009). The WEKA data mining software: An update. ACM SIGKDD explorations newsletter.

Olszak, C. M., & Ziemba, E. (2004). Business intelligence systems as a new generation of decision support systems. Proceedings PISTA 2004, International Conference on Politics and Information Systems: Technologies and Applications. Orlando: The International Institute of Informatics and Systemic.

Tinto, V. (1975). Dropout from Higher Education: A Theoretical Synthesis of Recent Research. Review of Educational Research vol.45, pp.89-125.
How to Cite
Ngemu, J. (2023). Enrolment Management in Higher Learning Institutions: Student Retention Prediction. Africa Journal of Technical and Vocational Education and Training, 8(1), 78-90. https://doi.org/10.69641/afritvet.2023.81162