A model for improved performance prediction using ensemble-based hybrid classification approach on a multivariate student dataset

  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract

    Classification techniques have sensed substantial attention in Information Engineering and Technology for the performance prediction and optimisation since few decades. The discovered accuracy of the Classification Model helps the institutional practices and student’s performances. In this paper, a novel Ensemble-based Hybrid Classification Approach (EHCA) has been proposed to be managed to produce improved performance prediction. The mining process with new attributes based on student behaviours has also been incorporated since it creates a great impact on their academic performances. Moreover, the performance of the students is analysed with a set of classifiers in Educational Data Mining (EDM) namely, Naive Bayesian, Support-Vector-Machine(SVM) and J48. Additionally, Ensemble approach is employed for enhancing the classifier performances. Here, the basic Ensemble methods such as Bagging, Classification Boosting and Stacking are used for optimising the results with more precision. Further, the process of Ensemble-based Hybrid Classification is analysed and tested with the dataset collected from Kerala Technological University-SNG College of Engineering(KTU_SNG). The results obtained are compared with the results obtained for utilized single classifiers and the EHCA on the basis of performance efficiency and classification accuracy. The work evidences the efficiency of the proposed approach and proves its reliability in Profound Performance Prediction and Optimisation.

  • Keywords

    Classification, Ensemble-based Hybrid Classification, EHCA, performance prediction, Educational Data Mining(EDM)

  • References

      [1] Aakash Tiwari, Aditya Prakash, "Improving classification of J48 algorithm using bagging, boosting and blending ensemble methods on SONAR dataset using WEKA”, Int. J. of Engg. and Tech.l Res. (IJETR) ISSN: 2321-0869, Volume-2, Issue-9, September 2014, pp. 207-209.

      [2] Agudo-Peregrina, Á.F., Iglesias-Pradas, S., Conde-González, M.Á. and Hernández-García, Á. (2014), “Can We Predict Success from Log Data in VLEs? Classification of Interactions for Learning Analytics and Their Relation with Performance in VLE-Supported F2F and Online Learning”, Comput.s in Hum. Behav., Vol. 31, pp. 542-550.

      [3] Akanksha Ahlawat1, Bharti, Bharti Suri, “Improving Classification in Data Mining Using Hybrid Algorithm”, IEEE Trans. on Know. and Data Engg, VOL. 17, pp.237-246.

      [4] Anoopkumar M and A. M. J. Md. Zubair Rahman, “A Review on Data Mining Techniques and Factors Used in Educational Data Mining to Predict Student Amelioration” 2016 IEEE Int. Conf. on Data Min. and Adv. Comput.g (SAPIENCE), March-2016 pp. 122-133.

      [5] Anoopkumar M and A. M. J. Md. Zubair Rahman, “Model of Tuned J48 Classification and Analysis of Performance Prediction in Educational Data Mining”, (IJAER) Int. J. of Applied Engg. Res. ISSN 0973-4562 Volume 13, Number 20 (2018) pp. 14717-14727.

      [6] Anoopkumar M and A. M. J. Md. Zubair Rahman, “Bound Model of Clustering and Classification (BMCC) for Proficient Performance Prediction of Didactical Outcomes of Students” (IJACSA) Int. J. of Adv. Comput. Sci. and Appl.s, Vol. 9, No. 11, 2018, pp. 1-9.

      [7] Ayon Sen, Md. Monirul Islam, Kazuyuki Murase, and Xin Yao, “Binarisation with Boosting and Oversampling for Multiclass Classification”, IEEE Trans. on Cybernet.s, Vol. 46, No. 5, May 2016, pp. 1078-1091.

      [8] Cerezo, R., Sánchez-Santillán, M., Paule-Ruiz, M.P. and Núñez, J.C. (2016), “Students’ LMS Interaction Patterns and Their Relationship with Achievement: A Case Study in Higher Education”, Comput.s & Edu., Vol. 96, pp. 42-54.

      [9] Cufoglu, M. Lohi and K. Madani, "A Comparative Study of Selected Classifiers with Classification Accuracy in User Profiling," 2009 WRI World Congress on Computer Science and Information Engineering, Los Angeles, CA, 2009, pp. 708-712. doi: 10.1109/CSIE.2009.954.

      [10] D. L. Gupta, A. K. Malviya, Satyendra Singh, "Performance Analysis of Classification Tree Learning Algorithms", Int. J. of Comp. Appli. (0975 – 8887), Volume 55– No.6, October 2012.

      [11] Fariba, T.B. (2013), “Academic Performance of Virtual Students Based On Their Personality Traits, Learning Styles and Psychological Wellbeing: A Prediction”, Proc.-Soc.l and Behav.l Sci., Vol. 84, pp. 112-116.

      [12] G. Kaur and A. Chhabra, “Improved J48 Classification Algorithm for the Prediction of Diabetes,” Int. J. Comput. Appl., vol. 98, no. 22, pp. 13–17, 2014.

      [13] Gray G., Mcguinness C., Owende P. (2016) “Non-Cognitive Factors of Learning as Early Indicators of Students at-Risk of Failing in Tertiary Education. In: Khine M.S., Areepattamannil S. (eds) Non-cognitive Skills and Factors in Educational Attainment”. Contem.y Approach. to Res. in Learn. Innov.s. Sense Publishers, Rotterdam, pp. 199-237.

      [14] Hina Anwar, Usman Qamar, and AbdulWahab Muzaffar Qureshi, “Global Optimization Ensemble Model for Classification Methods”, Hindawi Publishing Corp. Sci.c Worl. J. Vol 2014, Article ID 313164, pp. 1-9.

      [15] Hoe, A.C.K., Ahmad, M.S., Hooi, T.C., Shanmugam, M., Gunasekaran, S.S., Cob, Z.C. and Ramasamy, A. (2014), “Analyzing Students’ Records to Identify Patterns of Students’ Performance”, Res. and Inno. in Info. Sys. (ICRIIS), 2013 Int. Conf. on IEEE, Kuala Lumpur, pp. 544-547.

      [16] Ikbal, S., Tamhane, A., Sengupta, B., Chetlur, M., Ghosh, S. and Appleton, J. (2015), “On early prediction of risks in academic performance for students”, IBM J. of Res. and Develop.t, Vol. 59. No. 6, pp. 1-5.

      [17] Kotsiantis, S.B. and Pintelas, P.E. (2005), “Pred. Students’ Marks in Hellenic Open University”, Adv. Learn. Technol.s, ICALT 2005, 5th IEEE Int. Conf. on IEEE, Washington, DC, July 5-8, pp. 664-668.

      [18] M. S. Halawa, M. E. Shehab and E. M. R. Hamed, "Predicting student personality based on a data-driven model from student behavior on LMS and social networks," 2015 Fifth International Conference on Digital Information Processing and Communications (ICDIPC), in IEEE, Sierre, 2015, pp. 294-299. doi: 10.1109/ICDIPC.2015.7323044

      [19] Md. Rajib Hasan, Fadzilah Siraj, and Mohd Shamrie Sainin, “Improving ensemble decision tree performance using Adaboost and Bagging” AIP Conf. Proc.s 1691, 030008 (2015). https://doi.org/10.1063/1.4937027

      [20] Minaei-Bidgoli, B., Kashy, D.A., Kortemeyer, G. and Punch, W.F. (2003), “Predicting Student Performance: An Application of Data Mining Methods with an Educational Web-Based System”, IEEE, Front.s in Edu. FIE 2003 33rd Annual, Westminster, CO. Vol. 1, pp. 1-13.

      [21] Oladokun, V.O., Adebanjo, A.T. and Charles-Owaba, O.E. (2008), “Predicting Students’ Academic Performance Using Artificial Neural Network: A Case Study of an Engineering Course”, The Pacific J. of Sci. and Technol., Vol. 9. No. 1, pp. 72-79.

      [22] Prerna Kapoor1, Reena Rani, “Efficient Decision Tree Algorithm Using J48 and Reduced Error Pruning” Int. J. of Engg. Res. and Gen. Sci. Volume 3, Issue 3, May-June, 2015, pp. 1613-1621.

      [23] Rokach, L. “Ensemble-based classifiers,” Artif. Intelli. Rev., Vol. 33, pp. 1-39, 2010.

      [24] Romero, C., López, M.I., Luna, J.M. and Ventura, S. (2013), “Predicting Students’ Final Performance from Participation in On-Line Discussion Forums”, Comput.s & Edu.n, Vol. 68, pp. 458-472.

      [25] Sarker, F., Tiropanis, T. and Davis, H.C. (2013), “Exploring Student Predictive Model That Relies On Institutional Databases and Open Data Instead of Traditional Questionnaires”, Proc.s of the 22nd Int. Conf. on WWW. ACM, pp. 413-418.

      [26] Sembiring, S., Zarlis, M., Hartama, D., Ramliana, S. and Wani, E. (2011), “Prediction of Student Academic Performance by an Application of Data Mining Techniques”, Inte. Conf. on Manag.t and Artif.l Intelli. IPEDR, Vol. 6, pp. 110-114.

      [27] Swamy, M.N. and Hanumanthappa, M. (2012), “Predicting Academic Success from Student Enrolment Data Using Decision Tree Technique”, Int. J. of App. Info. Sys., Vol. 4. No. 3, pp. 1-6.

      [28] Trstenjak, B. and Donko, D. (2014), “Determining The Impact of Demographic Features in Predicting Student Success in Croatia”, 37th Int. Conv.n on Info. and Comm.n Tech., Electronics and Microelectronics (MIPRO), IEEE, pp. 1222-1227.

      [29] Wang, X. “Modeling Entrance into STEM Fields of Study among Students Beginning at Beginning at Community Colleges and Four-Year Institutions,” Res. in High. Edu.n, 54 (6), 664-669, September 2013.

      [30] Yaswanth Kumar Alapati, “Combining Clustering with Classification: A Technique to Improve Classification Accuracy” Int. J. of Comput. Sci. Engg. (IJCSE), Vol. 5 No.06 Nov 2016, pp. 336- 338.

      [31] Zhi-Hua Zhou and Yuan Jiang, “NeC4.5: Neural Ensemble Based C4.5”, IEEE Trans. on Knowl. and Data Engg., VOL. 16, NO. 6, Jun 2004, pp. 770-773.




Article ID: 23542
DOI: 10.14419/ijet.v7i4.23542

Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.