A Brief Overview of the Accuracy of Classification Algorithms for Data Prediction in Machine Learning Applications

Lichung Jen, Yu-Hsiang Lin

Abstract


Many business applications rely on their history data to anticipate their company future. The marketing products process is one of the essential procedures for the firm. Customer needs supply a useful piece of information that helps to promote the suitable products at the proper moment. Moreover, services are recognized recently as products. The development of education and health services is reliant on historical data. For the more, lowering online social media networks problems and crimes need a big supply of information. Data analysts need to utilize an efficient categorization system to predict the future of such businesses. However, dealing with a vast quantity of data demands tremendous time to process. Data mining encompasses numerous valuable techniques that are used to anticipate statistical data in a number of business applications. The classification technique is one of the most extensively utilized with a range of algorithms. In this work, numerous categorization methods are revised in terms of accuracy in diverse domains of data mining applications. A complete analysis is done following delegated reading of 20 papers in the literature. This study intends to allow data analysts to identify the best suitable classification algorithm for numerous commercial applications including business in general, online social media networks, agriculture, health, and education. Results reveal FFBPN is the best accurate algorithm in the business arena. The Random Forest algorithm is the most accurate in categorizing online social networks (OSN) activity. Naïve Bayes method is the most accurate to classify agriculture datasets. OneR is the most accurate method to classify occurrences inside the health domain. The C4.5 Decision Tree method is the most accurate to classify students’ records to forecast degree completion time.


Article Metrics

Abstract: 347 Viewers PDF: 211 Viewers

Keywords


Data Prediction Techniques; Accuracy; Classification Algorithms; Data Mining Applications

Full Text:

PDF


References


Harkiran, K. (2017) A Study On Data Mining Techniques And Their Areas Of Application. International Journal of Recent Trends in Engineering and Research, 3, 93-95. https://doi.org/10.23883/IJRTER.2017.3393.EO7O3

Silva, J., Borré, J.R., Castillo, A.P.P., Castro, L. and Varela, N. (2019) Integration of Data Mining Classification Techniques and Ensemble Learning for Predicting the Export Potential of a Company. Procedia Computer Science, 151, 1194-1200. https://doi.org/10.1016/j.procs.2019.04.171

Sirisup, C. and Songmuang, P. (2018) Exploring Efficiency of Data Mining Techniques for Missing Link in Online Social Network. 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP), Pattaya, 15-17 November 2018. https://doi.org/10.1109/iSAI-NLP.2018.8692951

Chiranjeevi, M.N. and Nadagoudar, R.B. (2018) Analysis of Soil Nutrients Using Data Mining Techniques. International Journal of Recent Trends in Engineering and Research, 4, 103-107. https://doi.org/10.23883/IJRTER.2018.4363.PDT1C

Jeong, K., Hong, T., Chae, M. and Kim, J. (2019) Development of a Decision Support Model for Determining the Target Multi-Family Housing Complex for Green Remodeling Using Data Mining Techniques. Energy and Buildings, 202, Article ID: 109401. https://doi.org/10.1016/j.enbuild.2019.109401

Kaur, B., Ahuja, L. and Kumar, V. (2019) Crime against Women: Analysis and Prediction Using Data Mining Techniques. International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), 14-16 February 2019, Faridabad. https://doi.org/10.1109/COMITCon.2019.8862195

Mia, M.R., Hossain, S.A., Chhoton, A.C. and Chakraborty, N.R. (2018) A Comprehensive Study of Data Mining Techniques in Health-Care, Medical, and Bioinformatics. International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), Rajshahi, 8-9 February 2018. https://doi.org/10.1109/IC4ME2.2018.8465626

Amornsinlaphachai, P. (2016) Efficiency of Data Mining Models to Predict Academic Performance and a Cooperative Learning Model. 8th International Conference on Knowledge and Smart Technology (KST), Chiang Mai, 3-6 February 2016. https://doi.org/10.1109/KST.2016.7440483

Roy, S. and Garg, A. (2017) Analyzing Performance of Students by Using Data Mining Techniques: A Literature Survey. 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON), Mathura, 26-28 October 2017. https://doi.org/10.1109/UPCON.2017.8251035

Khongchai, P. and Songmuang, P. (2017) Implement of Salary Prediction System to Improve Student Motivation Using Data Mining Technique. 11th International Conference on Knowledge, Information and Creativity Support Systems (KICSS), Yogyakarta, 10-12 November 2016. https://doi.org/10.1109/KICSS.2016.7951419

Besimi, N., Cico, B. and Besimi, A. (2017) Overview of Data Mining Classification Techniques: Traditional vs. Parallel/Distributed Programming Models. Proceedings of the 6th Mediterranean Conference on Embedded Computing, Bar, 11-15 June 2017, 1-4. https://doi.org/10.1109/MECO.2017.7977126

Kumar, S.R., Jassi, J.S., Yadav, S.A. and Sharma, R. (2016) Data-Mining a Mechanism against Cyber Threats: A Review. International Conference on Innovation and Challenges in Cyber Security (ICICCS-INBUSH), Greater Noida, 3-5 February 2016. https://doi.org/10.1109/ICICCS.2016.7542343

Thongsatapornwatana, U. (2016) A Survey of Data Mining Techniques for Analyzing Crime Patterns. Second Asian Conference on Defence Technology (ACDT), Chiang Mai, 21-23 January 2016. https://doi.org/10.1109/ACDT.2016.7437655

Kaur, S. and Bawa, R.K. (2017) Data Mining for diagnosis in Healthcare Sector-a review, International Journal of Advances in Scientific Research and Engineering.

T. wahyuningsih, “Text Mining an Automatic Short Answer Grading (ASAG), Comparison of Three Methods of Cosine Similarity, Jaccard Similarity and Dice’s Coefficient,” J. Appl. Data Sci., vol. 2, no. 2, pp. 45–54, 2021, doi: 10.47738/jads.v2i2.31.

T. T. Kim Phuong, “Proposing a Theoretical Model to Determine Factors Affecting on Job Satisfaction, Job Performance and Employees Loyalty For Technology Information (IT) Workers,” Int. J. Appl. Inf. Manag., vol. 1, no. 4, pp. 201–209, 2021, doi: 10.47738/ijaim.v1i4.21.

W.-J. Su, “The Effects of Safety Management Systems, Attitude and Commitment on Safety Behaviors and Performance,” Int. J. Appl. Inf. Manag., vol. 1, no. 4, pp. 187–199, 2021, doi: 10.47738/ijaim.v1i4.20.

Vaishali, S., Parsania, N., Jani, N. and Bhalodiya, N.H. (2014) Applying Naïve Bayes, BayesNet, PART, JRip and OneR Algorithms on Hypothyroid Database for Comparative Analysis. International Journal of Darshan Institute on Engineering Research & Emerging Technologies, 3, 60-64.

Jalota, C. and Agrawal, R. (2019) Analysis of Educational Data Mining using Classification. International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, 14-16 February 2019. https://doi.org/10.1109/COMITCon.2019.8862214

Al-Nadabi, S.S. and Jayakumari, C. (2019) Predict the Selection of Mathematics Subject for 11th Grade Students Using Data Mining Technique. 4th MEC International Conference on Big Data and Smart City (ICBDSC), Muscat, 15-16 January 2019. https://doi.org/10.1109/ICBDSC.2019.8645594

Wati, M., Haeruddin and Indrawan, W. (2017) Predicting Degree-Completion Time with Data Mining. 3rd International Conference on Science in Information Technology (ICSITech), Bandung, 25-26 October 2017. https://doi.org/10.1109/ICSITech.2017.8257209

Anoopkumar, M. andZubair Rahman, A.M.J.Md. (2016) A Review on Data Mining Techniques and Factors Used in Educational Data Mining to Predict Student Amelioration, International Conference on Data Mining and Advanced Computing (SAPIENCE), Ernakulam, 16-18 March 2016.

Rambola, R.K., Inamke, M. and Harne, S. (2018) Literature Review: Techniques and Algorithms Used for Various Applications of Educational Data Mining (EDM). 4th International Conference on Computing Communication and Automation (ICCCA), Greater Noida, 14-15 December 2018. https://doi.org/10.1109/CCAA.2018.8777556


Refbacks

  • There are currently no refbacks.



Barcode

Journal of Applied Data Sciences

ISSN : 2723-6471 (Online)
Organized by : Departement of Information System, Universitas Amikom Purwokerto, Indonesia; Computer Science and Systems Information Technology, King Abdulaziz University, Kingdom of Saudi Arabia.
Website : http://bright-journal.org/JADS
Email : taqwa@amikompurwokerto.ac.id (principal contact)
    husniteja@uinjkt.ac.id (managing editor)
    support@bright-journal.org (technical issues)

 This work is licensed under a Creative Commons Attribution-ShareAlike 4.0