Comparative study on dimensionality reduction for disease diagnosis using fuzzy classifier

 
 
 
  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract


    Machine learning is the worldwide recent research technique for various systems as they are intelligent enough to find the solution for classification and prediction problems. The proposed work is about a hybrid genetic fuzzy algorithm that performs an optimal search as well as classification upon uncertain data. The data which is uncertain is suitable for fuzzy classifiers to predict the disease. The hybrid genetic fuzzy system applied on the attributes selects relevant attributes. The selected attributes are fed into the fuzzy classifier. The fuzzy rules are again generated using genetic algorithms. This algorithm is applied on three of the important and bench marking data sets taken from the UCI machine learning repository. The heart disease, Wisconsin breast cancer and Pima Indian diabetes datasets produce classification accuracy as 89.65%, 99.5% and 88.93% respectively. In this article there is a comparative study on few of the feature selection and feature reduction techniques.


  • Keywords


    Feature Selection; Feature Extraction; Genetic Algorithms; Disease Diagnosis; Fuzzy Classifier.

  • References


      [1] Jingfeng, C, Medicine in China. Encyclopedia of the History of Science, Technology, and Medicine in Non-Western Cultures ,(2008), 1529–1534

      [2] Thompson, Carl, and Dawn Dowding. Essential Decision Making and Clinical Judgement for Nurses E-Book. Elsevier Health Sciences, 2009.

      [3] Cios, Krzysztof J., et al. Data mining: a knowledge discovery approach. Springer Science & Business Media, 2007.

      [4] Ahmad, Fadzil, et al., Intelligent breast cancer diagnosis using hybrid GA-ANN, Computational Intelligence, Communication Systems and Networks (CICSyN), 2013 Fifth International Conference on. IEEE, 2013.

      [5] Jaganathan, P., and R. Kuppuchamy, A threshold fuzzy entropy based feature selection for medical database classification, Computers in Biology and Medicine 43, 12, (2013), 2222-2229.

      [6] Wang, Peng, Cesar Sanin, and Edward Szczerbicki, Evolutionary algorithm and decisional DNA for multiple travelling salesman problem, Neurocomputing, 150, (2015), 50-57. https://doi.org/10.1016/j.neucom.2014.01.075.

      [7] Pham, Dinh Thanh, and Thi Thanh Binh Huynh, An Effective Combination of Genetic Algorithms and the Variable Neighborhood Search for Solving Travelling Salesman Problem, Technologies and Applications of Artificial Intelligence (TAAI), 2015 Conference on. IEEE, 2015.

      [8] Shen, Zhonghua, Keith J. Burnham, and Leonid Smalov, Optimised job-shop scheduling via genetic algorithm for a manufacturing production system, Progress in Systems Engineering. Springer, Cham, (2015), 89-92.

      [9] Sivanandam, S. N., and S. N. Deepa. Introduction to genetic algorithms. Springer Science & Business Media, 2007.

      [10] Veenstra, Michelle Anne, et al, Raman spectroscopy in the diagnosis of ulcerative colitis, European Journal of Pediatric Surgery 25, 01 (2015), 56-59.

      [11] Hariharan, Muthusamy, Kemal Polat, and Ravindran Sindhu, A new hybrid intelligent system for accurate detection of Parkinson's disease, Computer methods and programs in biomedicine, 113,3, (2014), 904-913.

      [12] Giri, Donna, et al., Automated diagnosis of coronary artery disease affected patients using LDA, PCA, ICA and discrete wavelet transform, Knowledge-Based Systems, 37, (2013), 274-282. https://doi.org/10.1016/j.knosys.2012.08.011.

      [13] Çalişir, Duygu, and Esin Doğantekin, An automatic diabetes diagnosis system based on LDA-Wavelet Support Vector Machine Classifier, Expert Systems with Applications, 38,7, (2011), 8311-8315.

      [14] Alavala, Chennakesava R. Fuzzy logic and neural networks: basic concepts & application. New Age International, 2008.

      [15] Santhi, D., D. Manimegalai, and S. Karkuzhali. Diagnosis of diabetic retinopathy by exudates detection using clustering techniques, Biomedical Engineering: Applications, Basis and Communications, 26, 06, (2014), 1450077.

      [16] Assadi, Ava, and Saman Harati Zade, UGA: A new genetic algorithm-based classification method for uncertain data, Mid-Est J Scient Res 20.10, (2014), 1207-1212.

      [17] Shamshirband, Shahaboddin, et al, Tuberculosis disease diagnosis using artificial immune recognition system, International journal of medical sciences 11, 5 (2014), 508.

      [18] Zhang, Xiaofan, et al., towards large-scale histopathological image analysis: Hashing-based image retrieval, IEEE Transactions on Medical Imaging, 34, 2, (2015), 496-506.

      [19] Papakostas, George A., et al., A lattice computing approach to Alzheimer’s disease computer assisted diagnosis based on MRI data, Neurocomputing 150, (2015), 37-42. https://doi.org/10.1016/j.neucom.2014.02.076.

      [20] Balachandran, K., and R. Anitha, Dimensionality reduction based on the classifier models: Performance Issues in the prediction of Lung cancer, Software Engineering (CONSEG), 2012 CSI Sixth International Conference on. IEEE, 2012.

      [21] Ramani, R. Geetha, and Shomona Gracia Jacob, Improved classification of lung cancer tumors based on structural and physicochemical properties of proteins using data mining models, PloS one 8,3, (2013), e58772.

      [22] Sun, Shuping. An innovative intelligent system based on automatic diagnostic feature extraction for diagnosing heart diseases, Knowledge-Based Systems 75, (2015), 224-238. https://doi.org/10.1016/j.knosys.2014.12.001.

      [23] Alickovic, Emina, and Abdulhamit Subasi, Effect of multiscale PCA de-noising in ECG beat classification for diagnosis of cardiovascular diseases, Circuits, Systems, and Signal Processing 34, 2, (2015), 513-533.

      [24] Sujatha .R Ezhilmaran. Performance analysis of data mining classification techniques for chronic kidney disease. International Journal of Pharmacy and Technology, (2016), 8, 2, 13032-13037.

      [25] Zhi, Koh Yi, Oliver Faust, and Wenwei Yu, Wavelet based machine learning techniques for electrocardiogram signal analysis, Journal of Medical Imaging and Health Informatics, 4, 5, (2014), 737-742.

      [26] Vafaie, M. H., M. Ataei, and Hamid R. Koofigar, Heart diseases prediction based on ECG signals’ classification using a genetic-fuzzy system and dynamical model of ECG signals, Biomedical Signal Processing and Control 14, (2014), 291-296. https://doi.org/10.1016/j.bspc.2014.08.010.

      [27] I.Guyon, J.Weston, S.Barnhill, V.Vapnik, Gene Selection for cancer classification using support vector machine, Machine Learning, 2002, 389-422. https://doi.org/10.1023/A:1012487302797.

      [28] Han, Jiawei, Jian Pei, and Micheline Kamber. Data mining: concepts and techniques. Elsevier, 2011.


 

View

Download

Article ID: 8652
 
DOI: 10.14419/ijet.v7i1.8652




Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.