A model for improved performance prediction using ensemble-based hybrid classification approach on a multivariate student dataset

  • Authors

    • M Anoopkumar Research Scholar,Research and Development Centre,Bharathiar University, Coimbatore - 46,Tamilnadu, INDIA http://orcid.org/0000-0002-1473-5396
    • A. M. J. Md. Zubair Rahman Principal, Head of the Institution, Al-Ameen Engineering College, Erode, Tamilnadu, India
    https://doi.org/10.14419/ijet.v7i4.23542

    Received date: December 9, 2018

    Accepted date: March 24, 2019

    Published date: May 9, 2018

  • Classification, Ensemble-based Hybrid Classification, EHCA, performance prediction, Educational Data Mining(EDM)
  • Abstract

    Classification techniques have sensed substantial attention in Information Engineering and Technology for the performance prediction and optimisation since few decades. The discovered accuracy of the Classification Model helps the institutional practices and student’s performances. In this paper, a novel Ensemble-based Hybrid Classification Approach (EHCA) has been proposed to be managed to produce improved performance prediction. The mining process with new attributes based on student behaviours has also been incorporated, since it creates a great impact on their academic performances. Moreover, the performance of the students is analysed with a set of classifiers in Educational Data Mining (EDM) namely, Naive Bayesian, Support-Vector-Machine (SVM) and J48. Futuristic-bound Ensemble approach is employed for enhancing the classifier performances. Here, the futuristic methods of ensembles of Bagging, Classification Boosting and Stacking are used for optimising the results with more precision. Further, the process of Ensemble-based Hybrid Classification is analysed and tested with the dataset collected from Kerala Technological Univer-sity-SNG College of Engineering (KTU_SNG). The results obtained are compared with the results obtained for utilized single classi-fiers and the EHCA on the basis of performance efficiency and classification accuracy. The work evidence the efficiency of the pro-posed approach and proves its reliability in Profound Performance Prediction and Optimisation.

  • References

    1. Aakash Tiwari, Aditya Prakash, "Improving classification of J48 al-gorithm using bagging, boosting and blending ensemble methods on SONAR dataset using WEKA”, Int. J. of Engg. and Tech.l Res. (IJETR) ISSN: 2321-0869, Volume-2, Issue-9, September 2014, pp. 207-209.
    2. Agudo-Peregrina, Á.F., Iglesias-Pradas, S., Conde-González, M.Á. and Hernández-García, Á. (2014), “Can We Predict Success from Log Data in VLEs? Classification of Interactions for Learning Ana-lytics and Their Relationship with Performance in VLE-Supported F2F and Online Learning”, Comput.s in Hum. Behav., Vol. 31, pp. 542-550. https://doi.org/10.1016/j.chb.2013.05.031.
    3. Akanksha Ahlawat1, Bharti, Bharti Suri, “Improving Classification in Data Mining Using Hybrid Algorithm”, IEEE Trans. on Know. and Data Engg, VOL. 17, pp.237-246.
    4. Anoopkumar M and A. M. J. Md. Zubair Rahman, “A Review on Data Mining Techniques and Factors Used in Educational Data Min-ing to Predict Student Amelioration” 2016 IEEE Int. Conf. on Data Min. and Adv. Comput.g (SAPIENCE), March-2016 pp. 122-133. https://doi.org/10.1109/SAPIENCE.2016.7684113.
    5. Anoopkumar M and A. M. J. Md. Zubair Rahman, “Model of Tuned J48 Classification and Analysis of Performance Prediction in Educa-tional Data Mining”, (IJAER) Int. J. of Applied Engg. Res. ISSN 0973-4562 Volume 13, Number 20 (2018) pp. 14717-14727.
    6. Anoopkumar M and A. M. J. Md. Zubair Rahman, “Bound Model of Clustering and Classification (BMCC) for Proficient Performance Prediction of Didactical Outcomes of Students” (IJACSA) Int. J. of Adv. Comput. Sci. and Appl.s, Vol. 9, No. 11, 2018, pp. 1-9. https://doi.org/10.14569/IJACSA.2018.091133.
    7. Ayon Sen, Md. Monirul Islam, Kazuyuki Murase, and Xin Yao, “Binarisation with Boosting and Oversampling for Multiclass Classi-fication”, IEEE Trans. on Cybernet.s, Vol. 46, No. 5, May 2016, pp. 1078-1091. https://doi.org/10.1109/TCYB.2015.2423295.
    8. Cerezo, R., Sánchez-Santillán, M., Paule-Ruiz, M.P. and Núñez, J.C. (2016), “Students’ LMS Interaction Patterns and Their Relationship with Achievement: A Case Study in Higher Education”, Comput.s & Edu., Vol. 96, pp. 42-54. https://doi.org/10.1016/j.compedu.2016.02.006.
    9. Cufoglu, M. Lohi and K. Madani, "A Comparative Study of Select-ed Classifiers with Classification Accuracy in User Profiling," 2009 WRI World Congress on Computer Science and Information Engi-neering, Los Angeles, CA, 2009, pp. 708-712. doi: 10.1109/CSIE.2009.954. https://doi.org/10.1109/CSIE.2009.954.
    10. D. L. Gupta, A. K. Malviya, Satyendra Singh, "Performance Analy-sis of Classification Tree Learning Algorithms", Int. J. of Comp. Ap-pli. (0975 – 8887), Volume 55– No.6, October 2012. https://doi.org/10.5120/8762-2680.
    11. Fariba, T.B. (2013), “Academic Performance of Virtual Students Based on Their Personality Traits, Learning Styles and Psychological Wellbeing: A Prediction”, Proc.-Soc.l and Behav.l Sci., Vol. 84, pp. 112-116. https://doi.org/10.1016/j.sbspro.2013.06.519.
    12. G. Kaur and A. Chhabra, “Improved J48 Classification Algorithm for the Prediction of Diabetes,” Int. J. Comput. Appl., vol. 98, no. 22, pp. 13–17, 2014. https://doi.org/10.5120/17314-7433.
    13. Gray G., Mcguinness C., Owende P. (2016) “Non-Cognitive Factors of Learning as Early Indicators of Students At-Risk of Failing in Ter-tiary Education. In: Khine M.S., Areepattamannil S. (eds) Non-cognitive Skills and Factors in Educational Attainment”. Contem.y Approach. to Res. in Learn. Innov.s. Sense Publishers, Rotterdam, pp. 199-237. https://doi.org/10.1007/978-94-6300-591-3_10.
    14. Hina Anwar, Usman Qamar, and AbdulWahab Muzaffar Qureshi, “Global Optimization Ensemble Model for Classification Methods”, Hindawi Publishing Corp. Sci.c Worl. J. Vol 2014, Article ID 313164, pp. 1-9. https://doi.org/10.1155/2014/313164.
    15. Hoe, A.C.K., Ahmad, M.S., Hooi, T.C., Shanmugam, M., Gun-asekaran, S.S., Cob, Z.C. and Ramasamy, A. (2014), “Analyzing Students’ Records to Identify Patterns of Students’ Performance”, Res. and Inno. in Info. Sys. (ICRIIS), 2013 Int. Conf. on IEEE, Kua-la Lumpur, pp. 544-547. https://doi.org/10.1109/ICRIIS.2013.6716767.
    16. Ikbal, S., Tamhane, A., Sengupta, B., Chetlur, M., Ghosh, S. and Appleton, J. (2015), “On early prediction of risks in academic per-formance for students”, IBM J. of Res. and Develop.t, Vol. 59. No. 6, pp. 1-5. https://doi.org/10.1147/JRD.2015.2458631.
    17. Kotsiantis, S.B. and Pintelas, P.E. (2005), “Pred. Students’ Marks in Hellenic Open University”, Adv. Learn. Technol.s, ICALT 2005, 5th IEEE Int. Conf. on IEEE, Washington, DC, July 5-8, pp. 664-668.
    18. M. S. Halawa, M. E. Shehab and E. M. R. Hamed, "Predicting stu-dent personality based on a data-driven model from student behavior on LMS and social networks," 2015 Fifth International Conference on Digital Information Processing and Communications (ICDIPC), in IEEE, Sierre, 2015, pp. 294-299. https://doi.org/10.1109/ICDIPC.2015.7323044.
    19. Md. Rajib Hasan, Fadzilah Siraj, and Mohd Shamrie Sainin, “Im-proving ensemble decision tree performance using Adaboost and Bagging” AIP Conf. Proc.s 1691, 030008 (2015). https://doi.org/10.1063/1.4937027.
    20. Minaei-Bidgoli, B., Kashy, D.A., Kortemeyer, G. and Punch, W.F. (2003), “Predicting Student Performance: An Application of Data Mining Methods with an Educational Web-Based System”, IEEE, Front.s in Edu. FIE 2003 33rd Annual, Westminster, CO. Vol. 1, pp. 1-13. https://doi.org/10.1109/FIE.2003.1263284.
    21. Oladokun, V.O., Adebanjo, A.T. and Charles-Owaba, O.E. (2008), “Predicting Students’ Academic Performance Using Artificial Neural Network: A Case Study of an Engineering Course”, The Pacific J. of Sci. and Technol., Vol. 9. No. 1, pp. 72-79.
    22. Prerna Kapoor1, Reena Rani, “Efficient Decision Tree Algorithm Using J48 and Reduced Error Pruning” Int. J. of Engg. Res. and Gen. Sci. Volume 3, Issue 3, May-June 2015, pp. 1613-1621.
    23. Rokach, L. “Ensemble-based classifiers,” Artif. Intelli. Rev., Vol. 33, pp. 1-39, 2010. https://doi.org/10.1007/s10462-009-9124-7.
    24. Romero, C., López, M.I., Luna, J.M. and Ventura, S. (2013), “Pre-dicting Students’ Final Performance from Participation in On-Line Discussion Forums”, Comput.s & Edu.n, Vol. 68, pp. 458-472. https://doi.org/10.1016/j.compedu.2013.06.009.
    25. Sarker, F., Tiropanis, T. and Davis, H.C. (2013), “Exploring Student Predictive Model That Relies on Institutional Databases and Open Data Instead of Traditional Questionnaires”, Proc.s of the 22nd Int. Conf. on WWW. ACM, pp. 413-418. https://doi.org/10.1145/2487788.2487955.
    26. Sembiring, S., Zarlis, M., Hartama, D., Ramliana, S. and Wani, E. (2011), “Prediction of Student Academic Performance by an Appli-cation of Data Mining Techniques”, Inte. Conf. on Manag.t and Ar-tif.l Intelli. IPEDR, Vol. 6, pp. 110-114.
    27. Swamy, M.N. and Hanumanthappa, M. (2012), “Predicting Aca-demic Success from Student Enrolment Data Using Decision Tree Technique”, Int. J. of App. Info. Sys., Vol. 4. No. 3, pp. 1-6. https://doi.org/10.5120/ijais12-450654.
    28. Trstenjak, B. and Donko, D. (2014), “Determining the Impact of Demographic Features in Predicting Student Success in Croatia”, 37th Int. Conv.n on Info. and Comm.n Tech., Electronics and Mi-croelectronics (MIPRO), IEEE, pp. 1222-1227. https://doi.org/10.1109/MIPRO.2014.6859754.
    29. Wang, X. “Modeling Entrance into STEM Fields of Study among Students Beginning at Beginning at Community Colleges and Four-Year Institutions,” Res. in High. Edu.n, 54 (6), 664-669, September 2013. https://doi.org/10.1007/s11162-013-9291-x.
    30. Yaswanth Kumar Alapati, “Combining Clustering with Classifica-tion: A Technique to Improve Classification Accuracy” Int. J. of Comput. Sci. Engg. (IJCSE), Vol. 5 No.06 Nov 2016, pp. 336- 338.
    31. Zhi-Hua Zhou and Yuan Jiang, “NeC4.5: Neural Ensemble Based C4.5”, IEEE Trans. on Knowl. and Data Engg., VOL. 16, NO. 6, Jun 2004, pp. 770-773. https://doi.org/10.1109/TKDE.2004.11.
  • Downloads

  • How to Cite

    Anoopkumar, M., & Zubair Rahman, A. M. J. M. (2018). A model for improved performance prediction using ensemble-based hybrid classification approach on a multivariate student dataset. International Journal of Engineering and Technology, 7(4). https://doi.org/10.14419/ijet.v7i4.23542