C3IMD : An Efficient Class-Based Clustering Classifier for Im-putation Intelligent Medical Data

  • Authors

    • P. Premalatha Nehru Group of Institutions
    • S Subasree Hindustan Group of Institutions
    • N K Sakthivel Nehru Group of Institutions
    2018-08-23
    https://doi.org/10.14419/ijet.v7i2.27.12717
  • Classification, Clustering, Hybrid classifier, Imputation, Medical Data, SVM, DELM.
  • The fast evolution in medical application yields to abundance of huge amount of data in volume and velocity.  Due to this heterogeneous medical data generation from clinical trials, its typically not free from missing values.  Previously introduced imputation techniques don’t discourse the high spatiality problems and application of distance function that even have curse on high spatiality problem. Thus, there’s a necessity an Efficient and Accurate technique to overcome this problem in Medical Data Analysis. To address the above mentioned issues, this research work proposed an efficient Class-Based Clustering Classifier for Imputation Intelligent Medical Data (C3IMD).  This work was implemented in Bio Weka and studied thoroughly. To improve the classification and prediction accuracy, missing data in Medical Data Sets were filled efficiently with the help of proposed Cluster-Classifier Model. The experiments are repeated with various datasets and results are evaluated and compared with existing classifiers WPT-DELM and SVM-DELM. From the results obtained, it was revealed that the proposed Class-Based Clustering Classifier for Imputation Intelligent Medical Data (C3IMD) is outperforming both the existing models in terms of Classification Accuracy, Sensitivity, Specificity and FScore.

     

     
  • References

    1. [1] Ali Kalantari and et. al, “Computational intelligence approaches for classification of medical data: State-of-the-art, future challenges and research directions,†International Journal of Neuro Computing, Pp. 1- 21, (2017).

      [2] UshaRani Yelipe, Sammulal Porika, Madhu Golla, “An efficient approach for imputation and classification of medical data values using class-based clustering of medical records,†International Journal Computers and Electrical Engineering, Pp. 1-18, (2017).

      [3] Zhang S , Qin Z , Ling C , Sheng S, “Missing is useful: missing values in cost-sensitive decision trees,†IEEE Transaction on Knowledge Data Engineering, 17(12): Pp. 1689–93, (2005).

      [4] Zhang C , Qin Y , Zhu X , Zhang J , Zhang S, “Clustering-based missing value imputation for data preprocessing,†IEEE International Conference On Industrial Informatics, Pp. 1081–6, (2006).

      [5] Wang L , Fu D , Li Q , Mu Z, “Modelling method with missing values based on clustering and support vector regression,†Journal of Systems Engineering and Electronics, 21(1), Pp.142–7, (2010).

      [6] Kirkpatrick B , Stevens K, “Perfect phylogeny problems with missing values,†IEEE/ACM Transaction on Computational Biology Bioinformatics, 11(5), Pp. 928–41, (2014).

      [7] Choong M K , Charbit M , Yan H, “Autoregressive-model-based missing value estimation for DNA microarray time series data,†. IEEE Transactions on Information Technology in Biomedicine, 13(1), Pp. 131–7, (2009).

      [8] Zhang Z, “Missing data imputation: focusing on single imputation,†. Annals of Translational Medicine, 4(1), (2016).

      [9] D. Boyd , K. Crawford, “Critical questions for big data: provocations for a cultural, technological, and scholarly phenomenon,†Information, Communication & Society, 15 , Pp. 662–679, (2012) .

      [10] X. Wu , X. Zhu , G.-Q. Wu , W. Ding, “Data mining with big data,†IEEE Transactions on Knowledge and Data Engineering, 26, Pp. 97–107, (2014).

      [11] I.A.T. Hashem , I. Yaqoob , N.B. Anuar , S. Mokhtar , A. Gani , S.U. Khan , “The rise of “big data†on cloud computing: review and open research issues,†Journal of Information System, 47, Pp. 98–115, (2015) .

      [12] V. Mayer-Schönberger , K. Cukier, “Big Data: A Revolution That will Transform How We Live, Work, and Think, Houghton Mifflin Harcourt,†2013 .

      [13] A . Gani , A . Siddiqa , S. Shamshirband , F. Hanum , “A survey on indexing techniques for big data: taxonomy and performance evaluation,†Knowledge and Information Systems, 46, Pp.241–284, (2016).

      [14] Lewis HD, “Missing data in clinical trials,†New England Journal of Medicine, Pp. 2557–8, (2012) .

      [15] Luengo, J., García, S., Herrera, F., “A study on the use of imputation methods for experimentation with radial basis function network classifiers handling missing attribute values: the good synergy between RBFs and event covering method,†Neural Networks. 23 406–418,(2009).

      [16] Zhang C , Qin Y , Zhu X , Zhang J , Zhang S., “Clustering-based missing value imputation for data preprocessing,â€IEEE international conference on industrial informatics; Pp. 1081–6, (2006).

      [17] Wang L , Fu D , Li Q , Mu Z, “Modelling method with missing values based on clustering and support vector regression,†Journal of Systems Engineering and Electronics, 21(1), Pp.142–7, (2012).

      [18] Zhu X , Zhang S , Zhi J , Zhang Z , Xu Z, “Missing value estimation for mixed-attribute data sets,†IEEE Transactions on Knowledge and Data Engineering, 23(1), Pp.110–21, 2013.

      [19] Farhangfar A , Kurgan L , Pedrycz, “A novel framework for imputation of missing values in databases,†IEEE transactions on systems, man, and cybernetics, 37(5), Pp.692–709, (2017). .

      [20] Choong MK , Charbit M , Yan H, “Autoregressive-model-based missing value estimation for DNA microarray time series data,†IEEE Transactions on Information Technology in Biomedicine, 13(1), Pp.131–7, (2009). .

      [21] Thirukumaran S, Sumathi A. “Improving accuracy rate of imputation of missing data using classifier methods,†International Conference On Intelligent Systems And Control (ISCO), Coimbatore; Pp.1–7, (2016).

      [22] Razavi-Far R, Saif M, “Imputation of missing data using fuzzy neighborhood density-based clustering,†IEEE international conference on fuzzy systems (FUZZ-IEEE), Pp. 1834–41, (2016).

      [23] Aljuaid T, Sasi S., “Proper imputation techniques for missing values in data sets,â€International Conference On Data Science And Engineering (ICDSE), Pp. 1–5, (2016)

      [24] Song Q , Shepperd M, “A new imputation method for small software project data sets,†Journal of Systems and Software, 80(1), Pp. 51–62, (2006).

      [25] Gira A, “Estimation of population mean with a new imputation method,†Applied Mathematical Sciences, 9(34), Pp.1663–72, 2015.

      [26] Nishanth KJ , et al, “Soft computing based imputation and hybrid data and text mining: the case of predicting the severity of phishing alerts,†Journal of Expert Systems With Applications, 39(12), Pp. 10583–9, (2012).

      [27] Tang F , Ishwaran H, “Random forest missing data algorithms," Journal of Statistical Analysis and Data Mining (2017).

      [28] Petrozziello A , Jordanov I , “Column-wise guided data imputation,â€. Procedia Computer Science, 108, Pp. 2282–6, (2017).

      [29] A . Mellit , S.A . Kalogirou, “Artificial intelligence techniques for photovoltaic ap-plications: a review,†Journal of Progress in Energy and Combustion Science, 34, Pp. 574–632, (2008).

      [30] A.P. Engelbrecht , “Computational Intelligence: An Introduction,†John Wiley & Sons, (2007).

      [31] F. Latifo ˇglu , H. Kodaz , S. Kara , S. Güne, “Medical application of artificial immune recognition system (AIRS): diagnosis of atherosclerosis from carotid artery Doppler signals,†Journal of Computers in Biology and Medicine, 37, Pp.1092–1099, (2007).

      [32] X. Gu , T. Ni , H. Wang, “New fuzzy support vector machine for the class imbalance problem in medical datasets classification,†Scientific World Journal, Pp.1–12, 2014.

  • Downloads

  • How to Cite

    Premalatha, P., Subasree, S., & K Sakthivel, N. (2018). C3IMD : An Efficient Class-Based Clustering Classifier for Im-putation Intelligent Medical Data. International Journal of Engineering & Technology, 7(2.27), 255-260. https://doi.org/10.14419/ijet.v7i2.27.12717