Evolutionary Algorithm Pipeline for Improving in Prediction of ‎Earlier Diagnosis of Breast Cancer

  • Authors

    • G. Sridevi Research Scholar, Department of Computer Science, School of Computing Sciences, Vels Institute of Science, Technology and Advanced ‎Studies (VISTAS), Pallavaram, Chennai, Tamil Nadu, India
    • K. Sharmila Research Supervisor and Associate Professor in the Department of Computer Science, School of Computing Sciences, Vels Institute of Science, ‎Technology and Advanced Studies (VISTAS), Pallavaram, Chennai, Tamil Nadu, India
    https://doi.org/10.14419/qs9zrt46

    Received date: July 15, 2025

    Accepted date: July 24, 2025

    Published date: November 1, 2025

  • Breast Cancer; Artificial Neural Network; Ensemble Learning; Machine Learning; Evolutionary Algorithm
  • Abstract

    The cells in the breast have been developed in terms of cancer named Breast Cancer (BC), and over the past decade, BC has been the most ‎identified type of cancer in women. Currently, the rapid increase in the predominance of BC may lead to increased mortality worldwide. The ‎disorder is diagnosed manually, which requires efficient expertise and time, both of which are significant in detecting cancer earlier and ‎informing subsequent treatments. There are various Machine Learning (ML) methods that assist in making several decisions as well as ‎performing diagnoses from the collected data, but have failed in accomplishing high prediction in diagnosing BC. This paper focuses on ‎investigating the probability of Ensemble Learning (EL) with Evolutionary Algorithm (EA) for risk prediction and BC categorization, ‎precisely through exact feature selection and extraction. Moreover, the key focus is on creating a consistent and precise model by associating ‎the best features of various learning algorithms and ensembling the metadata sets. Therefore, this research focuses on analyzing methods to ‎improve the accuracy of BC diagnosis through a proposed ensemble Artificial Neural Network (ANN) method with a Keras-based ‎approach. Hence, the evaluated results propose a method determined through accuracy metrics, which are compared with the Boruta method ‎and the ANN method. The accuracy of the Keras-based ensemble ANN method, at 97.37%, is higher than that of other ML methods‎.

  • References

    1. Park, M. Y. et al. Function and application of flavonoids in the breast cancer. Int. J. Mol. Sci. 23, 7732 (2022). https://doi.org/10.3390/ijms23147732.
    2. Uddin, K. M. M., Biswas, N., Rikta, S. T. & Dey, S. K. Machine learning-based diagnosis of breast cancer utilizing feature optimization technique. Comput. Methods Progr. Biomed. Update 3, 100098 (2023). https://doi.org/10.1016/j.cmpbup.2023.100098.
    3. A. Sharma and P. K. Mishra, “Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis,” International Journal of Information Technology (Singapore), vol. 14, no. 4, pp. 1949–1960, Jun. 2022, https://doi.org/10.1007/s41870-021-00671-5.
    4. S. S. Yadav and S. M. Jadhav, “Thermal infrared imaging based breast cancer diagnosis using machine learning techniques,” Multimed Tools Appl, vol. 81, no. 10, pp. 13139–13157, Apr. 2022, https://doi.org/10.1007/s11042-020-09600-3.
    5. A. Alzu’bi, H. Najadat, W. Doulat, O. Al-Shari, and L. Zhou, “Predicting the recurrence of breast cancer using machine learning algorithms,” Mul-timed Tools Appl, vol. 80, no. 9, pp. 13787–13800, Apr. 2021, https://doi.org/10.1007/s11042-020-10448-w.
    6. S. Rani, M. Kaur, and M. Kumar, “Recommender system: prediction/diagnosis of breast cancer using hybrid machine learning algorithm,” Mul-timed Tools Appl, vol. 81, no. 7, pp. 9939–9948, Mar. 2022, https://doi.org/10.1007/s11042-022-12144-3.
    7. Afolayan, J.O.; Adebiyi, M.O.; Arowolo, M.O.; Chakraborty, C.; Adebiyi, A.A. Breast cancer detection using particle swarm optimization and de-cision tree machine learning technique. In Intelligent Healthcare; Chakraborty, C., Khosravi, M.R., Eds.; Springer Nature: Singapore, 2022; pp. 61–83. https://doi.org/10.1007/978-981-16-8150-9_4.
    8. Guida, F.; Kidman, R.; Ferlay, J.; Schüz, J.; Soerjomataram, I.; Kithaka, B.; Ginsburg, O.; Vega, R.B.M.; Galukande, M.; Parham, G.;et al. Global and regional estimates of orphans attributed to maternal cancer mortality in 2020. Nat. Med. 2022, 28, 2563–2572. https://doi.org/10.1038/s41591-022-02109-2.
    9. W.Wang, R. Jiang, N. Cui, Q. Li, F. Yuan, and Z. Xiao, ‘‘Semi-supervised vision transformer with adaptive token sampling for breast cancer classi-fication,’’ Frontiers Pharmacol., vol. 13, Jul. 2022, Art. no. 929755. https://doi.org/10.3389/fphar.2022.929755.
    10. Z.-R. Jiang, L.-H. Yang, L.-Z. Jin, L.-M. Yi, P.-P. Bing, J. Zhou, and J.-S. Yang, ‘‘Identification of novel cuproptosis-related lncRNA signatures to predict the prognosis and immune microenvironment of breast cancer patients,’’ Frontiers Oncol., vol. 12, Sep. 2022, Art. no. 988680. https://doi.org/10.3389/fonc.2022.988680.
    11. M. Umer, M. Naveed, F. Alrowais, A. Ishaq, A. A. Hejaili, S. Alsubai, A. A. Eshmawi, A. Mohamed, and I. Ashraf, ‘‘Breast cancer detection us-ing convoluted features and ensemble machine learning algorithm,’’Cancers, vol. 14, no. 23, p. 6015, Dec. 2022. https://doi.org/10.3390/cancers14236015.
    12. B. He, Y. Zhang, Z. Zhou, B. Wang, Y. Liang, J. Lang, H. Lin, P. Bing, L. Yu, D. Sun, H. Luo, J. Yang, and G. Tian, ‘‘A neural network frame-work for predicting the tissue-of-origin of 15 common cancer types based on RNA-Seq data,’’ Frontiers Bioeng. Biotechnol., vol. 8, p. 737, Aug. 2020. https://doi.org/10.3389/fbioe.2020.00737.
    13. Chaurasia, V. & Pal, S. Applications of machine learning techniques to predict diagnostic breast cancer. SN Comput. Sci. 1, 1–11(2020). https://doi.org/10.1007/s42979-020-00296-8.
    14. Kabiraj, S. et al. Breast cancer risk prediction using XGBoost and random forest algorithm. In 2020 11th International Conference on Computing, Communication and Networking Technologies, ICCCNT 2020. https://doi.org/10.1109/ICCCNT49239.2020.9225451.
    15. Jabbar, M. A. Breast cancer data classification using ensemble machine learning. Eng. Appl. Sci. Res. 48, 65–72 (2021).
    16. Shalini, M. & Radhika, S. Machine learning techniques for prediction from various breast cancer datasets. In 2020 6th International Conference on Bio Signals, Images, and Instrumentation, ICBSII 2020.
    17. Naji, M. A. et al. Machine learning algorithms for breast cancer prediction and diagnosis. Procedia Comput. Sci. 191, 487–492 (2021). https://doi.org/10.1016/j.procs.2021.07.062.
    18. Benbrahim, H.; Hachimi, H.; Amine, A. Comparative study of machine learning algorithms using the breast cancer dataset. In Proceedings of the Advanced Intelligent Systems for Sustainable Development (AI2SD’2019), Marrakech, Morocco, 8–11 July 2019; Springer: Berlin/Heidelberg, Germany, 2020; pp. 83–91. https://doi.org/10.1007/978-3-030-36664-3_10.
    19. Ghiasi, M.M.; Zendehboudi, S. Application of decision tree-based ensemble learning in the classification of breast cancer. Comput. Biol. Med. 2021, 128, 104089. https://doi.org/10.1016/j.compbiomed.2020.104089.
    20. Alshayeji, M.H.; Ellethy, H.; Abed, S.; Gupta, R. Computer-aided detection of breast cancer on the Wisconsin dataset: An artificial neural net-works approach. Biomed. Signal Process. Control 2022, 71, 103141. https://doi.org/10.1016/j.bspc.2021.103141.
    21. Kwang Ho Park, ErdenebilegBatbaatar, YongjunPiao, NiponTheera-Umpon, Keun Ho Ryu, Deep learning feature extraction approach for hemato-poietic cancer subtype classification, Int. J. Environ. Res. Public Health 18 (4) (2021) 2197. https://doi.org/10.3390/ijerph18042197.
    22. Eun Young Park, Myungsun Yi, HyeSook Kim, Haejin Kim, A decision tree model for breast reconstruction of women with breast cancer: a mixed method approach, Int. J. Environ. Res. Public Health 18 (7) (2021) 3579. https://doi.org/10.3390/ijerph18073579.
    23. ZibaKhandezamin, MarjanNaderan, Mohammad JavadRashti, Detection and classification of breast cancer using logistic regression feature selec-tion and GMDH classifier, J. Biomed. Inform. 111 (2020) 103591. https://doi.org/10.1016/j.jbi.2020.103591.
    24. AbdurRasool, ChayutBunterngchit, Luo Tiejian, MdRuhul Islam, QiangQu, Qingshan Jiang, Improved machine learning-based predictive models for breast cancer diagnosis, Int. J. Environ. Res. Public Health 19 (6) (2022) 3211. https://doi.org/10.3390/ijerph19063211.
    25. Gary Pampar_a and Andries P Engelbrecht. Evolutionary and swarm-intelligence algorithms through monadic composition. In Proceedings of the Genetic and Evolutionary Computation Conference Companion, pages 1382{1390, 2019. https://doi.org/10.1145/3319619.3326845.
    26. Mirabi, M., & Lahti, E. (2025). Graph signal processing-based interference suppression for multi-user massive MIMO systems. Electronics, Com-munications, and Computing Summit, 3(2), 56–61.
    27. Veerappan, S. (2025). Digital Twin Modeling for Predictive Maintenance in Large-Scale Power Transformers. National Journal of Electrical Ma-chines & Power Conversion, 39-44.
    28. Abdullah, D. (2024). Design and implementation of secure VLSI architectures for cryptographic applications. Journal of Integrated VLSI, Embed-ded and Computing Technologies, 1(1), 21-25.
    29. Prasath, C. A. (2023). The role of mobility models in MANET routing protocols efficiency. National Journal of RF Engineering and Wireless Communication, 1(1), 39-48.
    30. Sadulla, S. (2025). IoT-enabled smart buildings: A sustainable approach for energy management. National Journal of Electrical Electronics and Au-tomation Technologies, 1(1), 14–23.
    31. Rahim, R. (2025). Lightweight speaker identification framework using deep embeddings for real-time voice biometrics. National Journal of Speech and Audio Processing, 1(1), 15–21.
    32. Surendar, A. (2025). Lightweight CNN architecture for real-time image super-resolution in edge devices. National Journal of Signal and Image Pro-cessing, 1(1), 1–8.
    33. Abbas, M. A., Hatem, T. M., Tolba, M. A., & Atia, M. (2023). Physical Design of Speed Improvised Factor in FPGA Applications. Journal of VLSI Circuits and Systems, 5(1), 61–66. https://doi.org/10.31838/jvcs/05.01.09.
    34. Marwedel, R., Jacobson, U., & Dobrigkeit, K. (2025). Embedded systems for real-time traffic management: Design, implementation, and challeng-es. SCCTS Journal of Embedded Systems Design and Applications, 2(1), 43–56.
  • Downloads

  • How to Cite

    Sridevi, G. ., & Sharmila , K. (2025). Evolutionary Algorithm Pipeline for Improving in Prediction of ‎Earlier Diagnosis of Breast Cancer. International Journal of Basic and Applied Sciences, 14(SI-1), 624-631. https://doi.org/10.14419/qs9zrt46