Data Mining Implementation with Algorithm C4.5 for Predicting Graduation Rate College Student

Jeffri Prayitno Bangkit Saputra, Retno Waluyo


Academic evaluation and graduation of students are critical components of an academic information system's (AIS) effectiveness since they allow for the measurement of student learning progress. Additionally, the assessment stating whether the student passed or failed would benefit both the student and teacher by acting as a reference point for future performance suggestions and evaluations. Using Decision Tree C4.5, a comprehensive analysis of the student academic evaluation approach was conducted. Age, gender, public or private high school status, high school department, organization activity, age at high school admission, progress GPA (pGPA), and total GPA (tGPA) were all documented and evaluated from semester 1–4 utilizing three times the graduation criterion periods. The article's scope is confined to undergraduate programs. An accuracy algorithm (AC) with a performance accuracy of 79.60 percent, a true positive rate (TP) of 77.70 percent, and 91 percent quality training data achieved the highest performance accuracy value.

Article Metrics

Abstract: 0 Viewers PDF: 0 Viewers


Data Mining; C4.5; Education; Graduation Prediction

Full Text:



A. I. Adekitan and O. Salau, “The impact of engineering students’ performance in the first three years on their graduation result using educational data mining,” Heliyon, vol. 5, no. 2, p. e01250, 2019, doi: 10.1016/j.heliyon.2019.e01250.

C. K. Lo, H. C. Chen, P. Y. Lee, M. C. Ku, L. Ogiela, and C. H. Chuang, “Smart dynamic resource allocation model for patient-driven mobile medical information system using C4.5 algorithm,” J. Electron. Sci. Technol., vol. 17, no. 3, pp. 231–241, 2019, doi: 10.11989/JEST.1674-862X.71018117.

H. Bin Wang and Y. J. Gao, “Research on C4.5 algorithm improvement strategy based on MapReduce,” Procedia Comput. Sci., vol. 183, pp. 160–165, 2021, doi: 10.1016/j.procs.2021.02.045.

L. G. Plata, C. G. Ramos, M. L. Silva Oliveira, and L. F. Silva Oliveira, “Release kinetics of multi-nutrients from volcanic rock mining by-products: Evidences for their use as a soil remineralizer,” J. Clean. Prod., vol. 279, p. 123668, 2021, doi: 10.1016/j.jclepro.2020.123668.

G. Lesinski and S. Corns, “Multi-objective evolutionary neural network to predict graduation success at the United States Military Academy,” Procedia Comput. Sci., vol. 140, pp. 196–205, 2018, doi: 10.1016/j.procs.2018.10.329.

C. Grac, X. Dolques, A. Braud, M. Trémolières, J. N. Beisel, and F. Le Ber, “Mining the sequential patterns of water quality preceding the biological status of waterbodies,” Ecol. Indic., vol. 130, 2021, doi: 10.1016/j.ecolind.2021.108070.

J. de J. Costa, F. Bernardini, D. Artigas, and J. Viterbo, “Mining direct acyclic graphs to find frequent substructures — An experimental analysis on educational data,” Inf. Sci. (Ny)., vol. 482, pp. 266–278, 2019, doi: 10.1016/j.ins.2019.01.032.

L. Bonilla Mejía, “Mining and human capital accumulation: Evidence from the Colombian gold rush,” J. Dev. Econ., vol. 145, no. July 2019, 2020, doi: 10.1016/j.jdeveco.2020.102471.

K. R. Pradeep and N. C. Naveen, “Lung Cancer Survivability Prediction based on Performance Using Classification Techniques of Support Vector Machines, C4.5 and Naive Bayes Algorithms for Healthcare Analytics,” Procedia Comput. Sci., vol. 132, pp. 412–420, 2018, doi: 10.1016/j.procs.2018.05.162.

M. Sharma, S. Joshi, and K. Govindan, “Issues and solutions of electronic waste urban mining for circular economy transition: An Indian context,” J. Environ. Manage., vol. 290, no. October 2020, p. 112373, 2021, doi: 10.1016/j.jenvman.2021.112373.

V. Fernandez, “Innovation in the global mining sector and the case of Chile,” Resour. Policy, vol. 68, no. January, p. 101690, 2020, doi: 10.1016/j.resourpol.2020.101690.

A. Hira and J. Busumtwi-Sam, “Improving mining community benefits through better monitoring and evaluation,” Resour. Policy, vol. 73, no. December 2020, p. 102138, 2021, doi: 10.1016/j.resourpol.2021.102138.

R. Benkercha and S. Moulahoum, “Fault detection and diagnosis based on C4.5 decision tree algorithm for grid connected PV system,” Sol. Energy, vol. 173, no. July, pp. 610–634, 2018, doi: 10.1016/j.solener.2018.07.089.

B. Sen and E. Ucar, “Evaluating the achievements of computer engineering department of distance education students with data mining methods,” Procedia Technol., vol. 1, pp. 262–267, 2012, doi: 10.1016/j.protcy.2012.02.053.

P. Dabhade, R. Agarwal, K. P. Alameen, A. T. Fathima, R. Sridharan, and G. Gopakumar, “Educational data mining for predicting students’ academic performance using machine learning algorithms,” Mater. Today Proc., no. xxxx, 2021, doi: 10.1016/j.matpr.2021.05.646.

R. Campagni, D. Merlini, R. Sprugnoli, and M. C. Verri, “Data mining models for student careers,” Expert Syst. Appl., vol. 42, no. 13, pp. 5508–5521, 2015, doi: 10.1016/j.eswa.2015.02.052.

X. Meng, P. Zhang, Y. Xu, and H. Xie, “Construction of decision tree based on C4.5 algorithm for online voltage stability assessment,” Int. J. Electr. Power Energy Syst., vol. 118, no. December 2019, p. 105793, 2020, doi: 10.1016/j.ijepes.2019.105793.

G. Lesinski, S. Corns, and C. Dagli, “Application of an Artificial Neural Network to Predict Graduation Success at the United States Military Academy,” Procedia Comput. Sci., vol. 95, pp. 375–382, 2016, doi: 10.1016/j.procs.2016.09.348.

S. J. Lee, Z. Xu, T. Li, and Y. Yang, “A novel bagging C4.5 algorithm based on wrapper feature selection for supporting wise clinical decision making,” J. Biomed. Inform., vol. 78, pp. 144–155, 2018, doi: 10.1016/j.jbi.2017.11.005.


  • There are currently no refbacks.


Journal of Applied Data Sciences

2723-6471 (Online)
Published by Bright Publisher
Puri Mersi Baru, Jl.Martadireja II, Gang Sitihingil 3 Blok A No 2, Purwokerto Timur, Jawa Tengah
Website :
Email :

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0