A machine learning based approach to classify autism with optimum behavior sets


  • Vaishali R VIT University
  • Sasikala R VIT University






Autism Spectrum Disorder, Behavioral Analytics, Machine Learning, Feature.


Machine Learning based behavioural analytics emphasis the need to develop accurate prediction models for detecting the risk of autism faster than the traditional diagnostic methods. Quality of prediction rely on the accuracy of the supplied dataset and the machine learning model.To improve accuracy of prediction, dimensionality reduction with feature selection is applied to eliminate noisy features from a dataset. In this work an ASD diagnosis dataset with 21 features obtained from UCI machine learning repository is experimented with swarm intelligence based binay firefly feature selection wrapper. The alternative hypothesis of the experiment claims that it is possible for a machine learning model to achieve a better classification accuracy with minimum feature subsets.Using Swarm intelligence based single-objective binary firefly feature selection wrapper it is found that 10 features among 21 features of ASD dataset are sufficient to distinguish between ASD and non-ASD patients.The results obtained with our approach justifies the hypothesis by producing an average accuracy in the range of 92.12%-97.95% with optimum feature subsets which is approximately equal to the average accuracy produced by entire ASD diagnosis dataset.





[1] J. Baio, “Prevalence of Autism Spectrum Disorder Among Children Aged 8 Years—Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2014,†MMWR. Surveill. Summ. vol. 67, 2018.

[2] A. P. Association and others, Diagnostic and statistical manual of mental disorders (DSM-5®). American Psychiatric Pub, 2013. https://doi.org/10.1176/appi.books.9780890425596.

[3] F. Thabtah, “Autism Spectrum Disorder Screening: Machine Learning Adaptation and DSM-5 Fulfillment,†in Proceedings of the first International Conference on Medical and Health Informatics 2017, 2017, pp. 1–6. https://doi.org/10.1145/3107514.3107515.

[4] M. Duda, R. Ma, N. Haber, and D. P. Wall, “Use of machine learning for behavioral distinction of autism and ADHD,†Transl. Psychiatry, vol. 6, no. 2, p. e732, 2017. https://doi.org/10.1038/tp.2015.221.

[5] J. A. Kosmicki, V. Sochat, M. Duda, and D. P. Wall, “Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning,†Transl. Psychiatry, vol. 5, no. 2, p. e514, 2015. https://doi.org/10.1038/tp.2015.7.

[6] F. Thabtah, “Machine learning in autistic spectrum disorder behavioral research: A review and ways forward,†Informatics Heal. Soc. Care, vol. 0, no. 0, pp. 1–20, 2018. https://doi.org/10.1080/17538157.2017.1399132.

[7] R. C. Team and others, “R: A language and environment for statistical computing,†2013.

[8] G. Holmes, A. Donkin, and I. H. Witten, “Weka: A machine learning workbench,†in Intelligent Information Systems, 1994. Proceedings of the 1994 Second Australian and New Zealand Conference on, 1994, pp. 357–361. https://doi.org/10.1109/ANZIIS.1994.396988.

[9] W. Siedlecki and J. Sklansky, “On automatic feature selection,†Int. J. Pattern Recognit. Artif. Intel. vol. 2, no. 02, pp. 197–220, 1988. https://doi.org/10.1142/S0218001488000145.

[10] L. Tolosi and T. Lengauer, “Classification with correlated features: unreliability of feature ranking and solutions,†Bioinformatics, vol. 27, no. 14, pp. 1986–1994, 2011. https://doi.org/10.1093/bioinformatics/btr300.

[11] X. Wang, J. Yang, X. Teng, W. Xia, and R. Jensen, “Feature selection based on rough sets and particle swarm optimization,†Pattern Recognit. Lett. vol. 28, no. 4, pp. 459–471, 2007. https://doi.org/10.1016/j.patrec.2006.09.003.

[12] A. Unler, A. Murat, and R. B. Chinnam, “mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification,†Inf. Sci. (Ny). vol. 181, no. 20, pp. 4625–4641, 2011. https://doi.org/10.1016/j.ins.2010.05.037.

[13] H. Banati and M. Bajaj, “Fire fly based feature selection approach,†IJCSI Int. J. Comput. Sci. Issues, vol. 8, no. 4, 2011.

[14] G. H. John and P. Langley, “Estimating continuous distributions in Bayesian classifiers,†in Proceedings of the Eleventh conference on Uncertainty in artificial intelligence, 1995, pp. 338–345.

[15] J. R. Quinlan, “C4. 5: Programming for machine learning,†Morgan Kauffmann, vol. 38, p. 48, 1993.

[16] S. S. Keerthi, S. K. Shevade, C. Bhattacharyya, and K. R. K. Murthy, “Improvements to Platt’s SMO algorithm for SVM classifier design,†Neural Comput., vol. 13, no. 3, pp. 637–649, 2001. https://doi.org/10.1162/089976601300014493.

[17] J. C. Platt, “12 fast training of support vector machines using sequential minimal optimization,†Adv. kernel methods, pp. 185–208, 1999.

[18] T. Hastie and R. Tibshirani, “Classification by pairwise coupling,†in Advances in neural information processing systems, 1998, pp. 507–513.

[19] D. W. Aha, D. Kibler, and M. K. Albert, “Instance-based learning algorithms,†Mach. Learn., vol. 6, no. 1, pp. 37–66, 1991. https://doi.org/10.1007/BF00153759.

[20] S. K. Pal and S. Mitra, “Multilayer perceptron, fuzzy sets, and classification,†IEEE Trans. neural networks, vol. 3, no. 5, pp. 683–697, 1992. https://doi.org/10.1109/72.159058.

[21] X.-S. Yang, “Firefly algorithm, Levy flights and global optimization,†in Research and development in intelligent systems XXVI, Springer, 2010, pp. 209–218. https://doi.org/10.1007/978-1-84882-983-1_15.

[22] H. Banati and M. Bajaj, “Fire fly based feature selection approach,†IJCSI Int. J. Comput. Sci. Issues, vol. 8, no. 4, 2011.

[23] L. Zhang, K. Mistry, C. P. Lim, and S. C. Neoh, “Feature selection using firefly optimization for classification and regression models,†Decis. Support Syst., 2017.

View Full Article: