An Ensemble and Filtering-Based System for Predicting Educational Data Mining

Andhika Rafi Hananto, Silvia Anggun Rahayu, Taqwa Hariguna


When developing a prediction paradigm, an ensemble technique such as boosting is used. It is built on a heuristic framework. Generally speaking, engineering ensemble learning is more accurate than individual classifiers when it comes to making predictions. Consequently, numerous ensemble strategies have been presented in this work, particularly to provide a more complete understanding of the essential methods in general. Researchers have experimented with boosting methods to forecast student performance as part of a variety of ensemble techniques. The researchers employed improvement approaches to construct an accurate predictive educational model, which was based on a key phenomena seen in categorization and prediction operations. In light of the uniqueness and originality of the suggested strategy in educational data mining, the researchers used augmentation strategies in order to construct an accurate predictive pedagogical model. Tenfold cross-validation was performed to evaluate the effectiveness of the basic classifiers, which included the random tree, the j48, the knn, and the Naive Bayes. The random tree was found to be the most effective classifier. Several additional screening techniques, including oversampling (SMOTE) and undersampling (Spread subsampling), were utilized to analyze any statistically significant variations in results between the meta and base classifiers that had been identified between the meta and base classifiers. The use of ensemble and screening strategies, as compared to the use of standard classifiers, has demonstrated considerable gains in predicting student performance, as has the use of either strategy alone. Furthermore, after the completion of a performance research on each approach, two new prediction models have been established on the basis of the improved results gained thus far.

Article Metrics

Abstract: 167 Viewers PDF: 135 Viewers


Education Data Mining; J48, Naive Bayes; Random Tree; K-NN; Ensemble; Boosting

Full Text:



S. G. Suganeshwari, I. Ibrahim, and G. Li, “Lazy collaborative filtering with dynamic neighborhoods,” Inf. Discov. Deliv., vol. 46, no. 2, pp. 95–109, Jan. 2018, doi: 10.1108/IDD-02-2018-0007.

W. Zhang, “Decomposition based least squares iterative estimation algorithm for output error moving average systems,” Eng. Comput., vol. 31, no. 4, pp. 709–725, Jan. 2014, doi: 10.1108/EC-07-2012-0154.

F. Alyari and N. Jafari Navimipour, “Recommender systems,” Kybernetes, vol. 47, no. 5, pp. 985–1017, Jan. 2018, doi: 10.1108/K-06-2017-0196.

M. Taimoor and L. Aijun, “Neural-sliding mode approach-based adaptive estimation, isolation and tolerance of aircraft sensor fault,” Aircr. Eng. Aerosp. Technol., vol. 92, no. 2, pp. 237–255, Jan. 2020, doi: 10.1108/AEAT-05-2019-0106.

D. H. Zhu, Y. W. Wang, and Y. P. Chang, “The influence of online cross-recommendation on consumers’ instant cross-buying intention,” Internet Res., vol. 28, no. 3, pp. 604–622, Jan. 2018, doi: 10.1108/IntR-05-2017-0211.

P. Virdi, A. D. Kalro, and D. Sharma, “Consumer acceptance of social recommender systems in India,” Online Inf. Rev., vol. 44, no. 3, pp. 723–744, Jan. 2020, doi: 10.1108/OIR-05-2018-0177.

Y. Ding, X. Xiao, X. Huang, and J. Sun, “System identification and a model-based control strategy of motor driven system with high order flexible manipulator,” Ind. Robot Int. J. Robot. Res. Appl., vol. 46, no. 5, pp. 672–681, Jan. 2019, doi: 10.1108/IR-01-2019-0012.

A. Cezar and H. Ögüt, “Analyzing conversion rates in online hotel booking,” Int. J. Contemp. Hosp. Manag., vol. 28, no. 2, pp. 286–304, Jan. 2016, doi: 10.1108/IJCHM-05-2014-0249.

P. Virdi, A. D. Kalro, and D. Sharma, “Online decision aids: the role of decision-making styles and decision-making stages,” Int. J. Retail Distrib. Manag., vol. 48, no. 6, pp. 555–574, Jan. 2020, doi: 10.1108/IJRDM-02-2019-0068.

G. Ramaswami, T. Susnjak, A. Mathrani, J. Lim, and P. Garcia, “Using educational data mining techniques to increase the prediction accuracy of student academic performance,” Inf. Learn. Sci., vol. 120, no. 7/8, pp. 451–467, Jan. 2019, doi: 10.1108/ILS-03-2019-0017.

S. Sedkaoui and M. Khelfaoui, “Understand, develop and enhance the learning process with big data,” Inf. Discov. Deliv., vol. 47, no. 1, pp. 2–16, Jan. 2019, doi: 10.1108/IDD-09-2018-0043.

J. Ranjan and K. Malik, “Effective educational process: a data‐mining approach,” VINE, vol. 37, no. 4, pp. 502–515, Jan. 2007, doi: 10.1108/03055720710838551.

X. Du, J. Yang, J.-L. Hung, and B. Shelton, “Educational data mining: a systematic review of research and emerging trends,” Inf. Discov. Deliv., vol. 48, no. 4, pp. 225–236, Jan. 2020, doi: 10.1108/IDD-09-2019-0070.

G. Özdağoğlu, G. Z. Öztaş, and M. Çağliyangil, “An application framework for mining online learning processes through event-logs,” Bus. Process Manag. J., vol. 25, no. 5, pp. 860–886, Jan. 2019, doi: 10.1108/BPMJ-10-2017-0279.

X. Yang, “Influence of informational factors on purchase intention in social recommender systems,” Online Inf. Rev., vol. 44, no. 2, pp. 417–431, Jan. 2020, doi: 10.1108/OIR-12-2016-0360.

F. Afsahhosseini and Y. Al-Mulla, “Smart, hybrid and context-aware POI mobile recommender system in tourism in Oman,” J. Cult. Herit. Manag. Sustain. Dev., vol. ahead-of-print, no. ahead-of-print, Jan. 2021, doi: 10.1108/JCHMSD-08-2021-0148.

Y. Yu, Z. Wang, and C. Lu, “An extended Kalman particle filter for power system dynamic state estimation,” COMPEL - Int. J. Comput. Math. Electr. Electron. Eng., vol. 37, no. 6, pp. 1993–2005, Jan. 2018, doi: 10.1108/COMPEL-11-2017-0493.

D. Samara, I. Magnisalis, and V. Peristeras, “Artificial intelligence and big data in tourism: a systematic literature review,” J. Hosp. Tour. Technol., vol. 11, no. 2, pp. 343–367, Jan. 2020, doi: 10.1108/JHTT-12-2018-0118.

A. Ray, P. K. Bala, and R. Jain, “Utilizing emotion scores for improving classifier performance for predicting customer’s intended ratings from social media posts,” Benchmarking An Int. J., vol. 28, no. 2, pp. 438–464, Jan. 2021, doi: 10.1108/BIJ-01-2020-0004.

N. Tadi Bani and S. Fekri-Ershad, “Content-based image retrieval based on combination of texture and colour information extracted in spatial and frequency domains,” Electron. Libr., vol. 37, no. 4, pp. 650–666, Jan. 2019, doi: 10.1108/EL-03-2019-0067.

B. Zhang, G. Du, W. Shen, and F. Li, “Gesture-based human-robot interface for dual-robot with hybrid sensors,” Ind. Robot Int. J. Robot. Res. Appl., vol. 46, no. 6, pp. 800–811, Jan. 2019, doi: 10.1108/IR-11-2018-0245.

T. Guan and L. Duan, “Recovering pose and occlusion consistencies in augmented reality systems using affine properties,” Sens. Rev., vol. 30, no. 2, pp. 148–158, Jan. 2010, doi: 10.1108/02602281011022751.

C. G. Selvi and L. G. G. Priya, “Three-way formal concept clustering technique for matrix completion in recommender system,” Int. J. Pervasive Comput. Commun., vol. 17, no. 2, pp. 167–183, Jan. 2021, doi: 10.1108/IJPCC-07-2019-0055.

K. Wakil, F. Alyari, M. Ghasvari, Z. Lesani, and L. Rajabion, “A new model for assessing the role of customer behavior history, product classification, and prices on the success of the recommender systems in e-commerce,” Kybernetes, vol. 49, no. 5, pp. 1325–1346, Jan. 2020, doi: 10.1108/K-03-2019-0199.

S. Gul, S. Bano, and T. Shah, “Exploring data mining: facets and emerging trends,” Digit. Libr. Perspect., vol. 37, no. 4, pp. 429–448, Jan. 2021, doi: 10.1108/DLP-08-2020-0078.


  • There are currently no refbacks.


Journal of Applied Data Sciences

ISSN : 2723-6471 (Online)
Organized by : Departement of Information System, Universitas Amikom Purwokerto, Indonesia; Computer Science and Systems Information Technology, King Abdulaziz University, Kingdom of Saudi Arabia.
Website :
Email : (principal contact) (managing editor) (technical issues)

 This work is licensed under a Creative Commons Attribution-ShareAlike 4.0