Software Concept and Knowledge Extraction for Data Mining In ‎Healthcare Sectors

  • Authors

    https://doi.org/10.14419/472vp602

    Received date: October 13, 2025

    Accepted date: January 2, 2026

    Published date: January 26, 2026

  • Software; Data Mining; Knowledge Management; Information Technology; Health Care
  • Abstract

    A technique called data mining makes it easy to extract valuable insights from large databases. It is the procedure of acquiring knowledge ‎from information. This technical capability has generated much interest in society recently, especially in the information sector. Focusing on ‎the healthcare industry, which has significant implementation issues, this article emphasizes the need for software in data mining by offering ‎important backing for the development of effective and maintainable systems. Designing dependable systems, specifying both functional ‎and non-functional needs, creating efficient algorithms, guaranteeing sustainable development, improving performance, and facilitating inte-‎gration are important elements of software pertinent to this field. The study underlines that software underpins the execution of good and ‎efficient data mining techniques. Additionally, this research will examine uses of data mining in healthcare information as well as related ‎problems. This descriptive study examines the ideas, problems, and techniques applied in data mining, particularly in the healthcare sector‎.

  • References

    1. S. Khan OKESOLA and M. Shaheen, "From data mining to wisdom mining," Journal of Information Science, vol. 49, no. 4, pp. 952-975, 2023. https://doi.org/10.1177/01655515211030872.
    2. R. Koppel, "Estimating the United States’ cost of healthcare information technology," in Healthcare Information Management Systems: Cases, Strategies, and Solutions: Springer, 2022, pp. 3-38. https://doi.org/10.1007/978-3-031-07912-2_1.
    3. E. P. Ambinder, "Electronic Health Records," Journal of Oncology Practice, vol. 1, no. 2, pp. 57-63, 2005, https://doi.org/10.1200/jop.2005.1.2.57.
    4. A. Nambiar and D. Mundra, "An overview of data warehouse and data lake in modern enterprise data management," Big data and cognitive com-puting, vol. 6, no. 4, p. 132, 2022. https://doi.org/10.3390/bdcc6040132.
    5. D. Shukla, S. B. Patel, and A. K. Sen, "A literature review in health informatics using data mining techniques," International Journal of Software and Hardware Research in Engineering, vol. 2, no. 2, pp. 123-129, 2014.
    6. A. Dogan and D. Birant, "Machine learning and data mining in manufacturing," Expert Systems with Applications, vol. 166, p. 114060, 2021/03/15/ 2021, https://doi.org/10.1016/j.eswa.2020.114060.
    7. W. I. D. Mining, Introduction to data mining. Springer, 2006.
    8. A. A. Hasan and H. Fang, "Data Mining in Education: Discussing Knowledge Discovery in Database (KDD) with Cluster Associative Study," pre-sented at the 2021 2nd International Conference on Artificial Intelligence and Information Systems, Chongqing, China, 2021. [Online]. Available: https://doi.org/10.1145/3469213.3471319.
    9. A. Thakkar and R. Lohiya, "A survey on intrusion detection system: feature selection, model, performance measures, application perspective, chal-lenges, and future research directions," Artificial Intelligence Review, vol. 55, no. 1, pp. 453-563, 2022. https://doi.org/10.1007/s10462-021-10037-9.
    10. K. Maharana, S. Mondal, and B. Nemade, "A review: Data pre-processing and data augmentation techniques," Global Transitions Proceedings, vol. 3, no. 1, pp. 91-99, 2022. https://doi.org/10.1016/j.gltp.2022.04.020.
    11. Y. Yao, "Symbols-Meaning-Value (SMV) space as a basis for a conceptual model of data science," International Journal of Approximate Reasoning, vol. 144, pp. 113-128, 2022. https://doi.org/10.1016/j.ijar.2022.02.001.
    12. S. Chakraborty, S. H. Islam, and D. Samanta, Data classification and incremental clustering in data mining and machine learning. Springer, 2022. https://doi.org/10.1007/978-3-030-93088-2.
    13. O. O., "DATA MINING METHODOLOGY AND ITS APPLICATION," 2021.
    14. D. K. Nayak, A. K. Mishra, and M. Mistry, "A BRIEF STUDY ON DATA MINING AND BIG DATA."
    15. R. R. Asaad and R. M. Abdulhakim, "The Concept of Data Mining and Knowledge Extraction Techniques," Qubahan Academic Journal, vol. 1, no. 2, pp. 17-20, 2021. https://doi.org/10.48161/qaj.v1n2a43.
    16. X. Shu and Y. Ye, "Knowledge Discovery: Methods from data mining and machine learning," Social Science Research, vol. 110, p. 102817, 2023. https://doi.org/10.1016/j.ssresearch.2022.102817.
    17. I. H. Sarker, "Machine learning: Algorithms, real-world applications and research directions," SN computer science, vol. 2, no. 3, p. 160, 2021. https://doi.org/10.1007/s42979-021-00592-x.
    18. A. Y. Abd Alazeez, "Data mining between classical and modern applications: A review," AL-Rafidain Journal of Computer Sciences and Mathe-matics, vol. 15, no. 2, pp. 171-191, 2021. https://doi.org/10.33899/csmj.2021.170020.
    19. A. Saini, "Data Mining Architecture–Data Mining Types and Techniques," 2021.
    20. C. McInerney, "Knowledge management and the dynamic nature of knowledge," Journal of the American society for Information Science and Technology, vol. 53, no. 12, pp. 1009-1018, 2002. https://doi.org/10.1002/asi.10109.
    21. D. Harman, "Knowledge Management (KM) Processes in Organizations: Theoretical Foundations and Practice/Harman D," ed: New York, NY: Basic Books, 2011.
    22. A. S. Wahjudewanti, J. H. Tjakraatmaja, and Y. Anggoro, "Knowledge Management Strategies to Improve Learning and Growth in Creative In-dustries: A Framework Model," Budapest International Research and Critics Institute-Journal (BIRCI-Journal) Vol, vol. 4, no. 2, pp. 1903-1915, 2021. https://doi.org/10.33258/birci.v4i2.1876.
    23. Y. Li, M. Kramer, A. J. Beulens, and J. G. van der Vorst, "A framework for early warning and proactive control systems in food supply chain net-works," Computers in Industry, vol. 61, no. 9, pp. 852-862, 2010. https://doi.org/10.1016/j.compind.2010.07.010.
    24. T. Silwattananusarn and K. Tuamsuk, "Data mining and its applications for knowledge management: a literature review from 2007 to 2012," arXiv preprint arXiv:1210.2872, 2012. https://doi.org/10.5121/ijdkp.2012.2502.
    25. S. V. G. Subrahmanya et al., "The role of data science in healthcare advancements: applications, benefits, and future prospects," Irish Journal of Medical Science (1971-), vol. 191, no. 4, pp. 1473-1483, 2022. https://doi.org/10.1007/s11845-021-02730-z.
    26. A. H. Al-Obaidi, "Data Mining In Healthcare Sectors," Wisdom Journal for Studies & Research, vol. 4, no. 06, pp. 1215-1237, 2024. https://doi.org/10.55165/wjfsar.v4i06.502.
    27. A. H. T. Al-Ghrairi, A. A. Mohammed, and H. M. Saeed, "An Application of Web-based E-Healthcare Management System Using ASP. Net," Webology, vol. 18, no. 1, 2021. https://doi.org/10.14704/WEB/V18I1/WEB18089.
    28. A. M. Mosadeghrad, "Factors influencing healthcare service quality," International journal of health policy and management, vol. 3, no. 2, p. 77, 2014. https://doi.org/10.15171/ijhpm.2014.65.
    29. S. A. Alowais et al., "Revolutionizing healthcare: the role of artificial intelligence in clinical practice," BMC medical education, vol. 23, no. 1, p. 689, 2023. https://doi.org/10.1186/s12909-023-04698-z.
    30. M. Javaid, A. Haleem, and R. P. Singh, "Health informatics to enhance the healthcare industry's culture: An extensive analysis of its features, con-tributions, applications and limitations," Informatics and Health, 2024. https://doi.org/10.1016/j.infoh.2024.05.001.
    31. H. Chang, "Recent movement on education and training in health informatics," Healthcare Informatics Research, vol. 20, no. 2, p. 79, 2014. https://doi.org/10.4258/hir.2014.20.2.79.
    32. A. Dutt, M. A. Ismail, and T. Herawan, "A systematic review on educational data mining," Ieee Access, vol. 5, pp. 15991-16005, 2017. https://doi.org/10.1109/ACCESS.2017.2654247.
    33. V. Švábenský, J. Vykopal, P. Čeleda, and L. Kraus, "Applications of educational data mining and learning analytics on data from cybersecurity training," Education and Information Technologies, vol. 27, no. 9, pp. 12179-12212, 2022. https://doi.org/10.1007/s10639-022-11093-6.
    34. P. Ryan et al., "Health Outcomes and Medical Effectiveness Research (HOMER): A Systematic Approach to Exploring Hill's Causal Viewpoints in Observational Data," in PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2014, vol. 23: WILEY-BLACKWELL 111 RIVER ST, HO-BOKEN 07030-5774, NJ USA, pp. 145-146.
    35. E. Shirzad, G. Ataei, and H. Saadatfar, "Applications of data mining in healthcare area: A survey," Engineering & Applied Science Research, vol. 48, no. 3, 2021.
    36. A. Alam and A. Mohanty, "Predicting students’ performance employing educational data mining techniques, machine learning, and learning analyt-ics," in International Conference on Communication, Networks and Computing, 2022: Springer, pp. 166-177. https://doi.org/10.1007/978-3-031-43140-1_15.
    37. S. Usman, R. Mehmood, I. Katib, and A. Albeshri, "Data locality in high performance computing, big data, and converged systems: An analysis of the cutting edge and a future system architecture," Electronics, vol. 12, no. 1, p. 53, 2022. https://doi.org/10.3390/electronics12010053.
    38. M. Bansal, A. Goyal, and A. Choudhary, "A comparative analysis of K-nearest neighbor, genetic, support vector machine, decision tree, and long short term memory algorithms in machine learning," Decision Analytics Journal, vol. 3, p. 100071, 2022. https://doi.org/10.1016/j.dajour.2022.100071.
    39. Z. Zhang and Z. Zhang, "Artificial neural network," Multivariate time series analysis in climate and environmental research, pp. 1-35, 2018. https://doi.org/10.1007/978-3-319-67340-0_1.
    40. K. Taunk, S. De, S. Verma, and A. Swetapadma, "A brief review of nearest neighbor algorithm for learning and classification," in 2019 internation-al conference on intelligent computing and control systems (ICCS), 2019: IEEE, pp. 1255-1260. https://doi.org/10.1109/ICCS45141.2019.9065747.
    41. M. Shouman, T. Turner, and R. Stocker, "Disease Patients," International Journal of Information and Education Technology, vol. 2, no. 3, 2012.
    42. B. Milovic and M. Milovic, "Prediction and decision making in health care using data mining," Arabian Journal of Business and Management Re-view (Kuwait Chapter), vol. 1, no. 12, pp. 126-136, 2012. https://doi.org/10.11591/ijphs.v1i2.1380.
    43. A. H. Abdulwahhab, A. H. Abdulaal, A. H. T. Al-Ghrairi, A. A. Mohammed, and M. Valizadeh, "Detection of epileptic seizure using EEG signals analysis based on deep learning techniques," Chaos, Solitons & Fractals, vol. 181, p. 114700, 2024. https://doi.org/10.1016/j.chaos.2024.114700.
    44. A. Sheikh et al., "Health information technology and digital innovation for national learning health and care systems," The Lancet Digital Health, vol. 3, no. 6, pp. e383-e396, 2021. https://doi.org/10.1016/S2589-7500(21)00005-4
    45. C. Sirocchi, A. Bogliolo, and S. Montagna, "Medical-informed machine learning: integrating prior knowledge into medical decision systems," BMC Medical Informatics and Decision Making, vol. 24, no. Suppl 4, p. 186, 2024. https://doi.org/10.1186/s12911-024-02582-4
    46. D. A. Neu, J. Lahann, and P. Fettke, "A systematic literature review on state-of-the-art deep learning methods for process prediction," Artificial Intelligence Review, vol. 55, no. 2, pp. 801-827, 2022. https://doi.org/10.1007/s10462-021-09960-8.
    47. H. Ameri, S. Alizadeh, and E. A. Z. Noughabi, "Application of data mining techniques in clinical decision making: A literature review and classifi-cation," Handbook of Research on Data Science for Effective Healthcare Practice and Administration, pp. 257-295, 2017. https://doi.org/10.4018/978-1-5225-2515-8.ch012.
    48. D. McGilvray, Executing data quality projects: Ten steps to quality data and trusted information (TM). Academic Press, 2021.
    49. T. Shaik et al., "Remote patient monitoring using artificial intelligence: Current state, applications, and challenges," Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 13, no. 2, p. e1485, 2023. https://doi.org/10.1002/widm.1485.
    50. T. Sarwar et al., "The secondary use of electronic health records for data mining: Data characteristics and challenges," ACM Computing Surveys (CSUR), vol. 55, no. 2, pp. 1-40, 2022. https://doi.org/10.1145/3490234.
    51. A. Ed-daoudy and K. Maalmi, "Breast cancer classification with reduced feature set using association rules and support vector machine," Network Modeling Analysis in Health Informatics and Bioinformatics, vol. 9, no. 1, p. 34, 2020. https://doi.org/10.1007/s13721-020-00237-8.
    52. S. Zhang, C. Zhang, and Q. Yang, "Data preparation for data mining," Applied artificial intelligence, vol. 17, no. 5-6, pp. 375-381, 2003. https://doi.org/10.1080/713827180.
    53. Z. Wu and V. Trigo, "Impact of information system integration on the healthcare management and medical services," International Journal of Healthcare Management, vol. 14, no. 4, pp. 1348-1356, 2021. https://doi.org/10.1080/20479700.2020.1760015.
    54. W.-T. Wu et al., "Data mining in clinical big data: the frequently used databases, steps, and methodological models," Military Medical Research, vol. 8, pp. 1-12, 2021. https://doi.org/10.1186/s40779-021-00338-z.
    55. G. Shmueli, P. C. Bruce, K. R. Deokar, and N. R. Patel, Machine learning for business analytics: Concepts, techniques, and applications with ana-lytic solver data mining. John Wiley & Sons, 2023.
    56. B. Al-Sahab, A. Leviton, T. Loddenkemper, N. Paneth, and B. Zhang, "Biases in Electronic Health Records Data for Generating Real-World Evi-dence: An Overview," Journal of Healthcare Informatics Research, vol. 8, no. 1, pp. 121-139, 2024. https://doi.org/10.1007/s41666-023-00153-2.
    57. T. Ramesh, U. K. Lilhore, M. Poongodi, S. Simaiya, A. Kaur, and M. Hamdi, "Predictive analysis of heart diseases with machine learning ap-proaches," Malaysian Journal of Computer Science, pp. 132-148, 2022. https://doi.org/10.22452/mjcs.sp2022no1.10.
    58. A. Ampavathi, "Research challenges and future directions towards medical data processing," Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, vol. 10, no. 6, pp. 633-652, 2022. https://doi.org/10.1080/21681163.2021.2018665.
    59. A. Gupta, R. Singh, V. K. Nassa, R. Bansal, P. Sharma, and K. Koti, "Investigating application and challenges of big data analytics with cluster-ing," in 2021 international conference on advancements in electrical, electronics, communication, computing and automation (ICAECA), 2021: IEEE, pp. 1-6. https://doi.org/10.1109/ICAECA52838.2021.9675483.
    60. M. M. Rahman, M. S. Chowdhury, M. Shorfuzzaman, L. Karim, M. Shafiullah, and F. Azzedin, "Enhancing Septic Shock Detection through Inter-pretable Machine Learning," CMES-Computer Modeling in Engineering & Sciences, vol. 141, no. 3, 2024. https://doi.org/10.32604/cmes.2024.055065.
    61. P. Kaushik, K. Sharma, M. K. Mahawar, J. Wasim, G. Dey, and S. A. Nibiya, "Ethical Considerations in Data Mining and Analytics," in 2024 1st International Conference on Advances in Computing, Communication and Networking (ICAC2N), 2024: IEEE, pp. 1516-1521. https://doi.org/10.1109/ICAC2N63387.2024.10895012.
    62. T. Tsalko, S. Nevmerzhytska, S. Krasniuk, S. Goncharenko, and N. Liubymova, "Features, problems and prospects of data mining and data science application in educational management," Bulletin of Science and Education (Series" Philology", Series" Pedagogy", Series" Sociology", Series" Culture and Art", Series" History and Archeology"), 2024. https://doi.org/10.52058/2786-6165-2024-5(23)-637-657.
    63. E. I. Kabanov, M. V. Tumanov, V. S. Smetanin, and K. V. Romanov, "An innovative approach to injury prevention in mining companies through human factor management," Записки Горного института, no. 263 (eng), pp. 774-784, 2023.
    64. P. Solana-González, A. A. Vanti, M. M. García Lorenzo, and R. E. Bello Pérez, "Data mining to assess organizational transparency across technology processes: An approach from it governance and knowledge management," Sustainability, vol. 13, no. 18, p. 10130, 2021. https://doi.org/10.3390/su131810130
    65. Y. Zhong, L. Chen, C. Dan, and A. Rezaeipanah, "A systematic survey of data mining and big data analysis in internet of things," The Journal of Supercomputing, vol. 78, no. 17, pp. 18405-18453, 2022. https://doi.org/10.1007/s11227-022-04594-1.
    66. V. S. Naresh and M. Thamarai, "Privacy‐preserving data mining and machine learning in healthcare: Applications, challenges, and solutions," Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 13, no. 2, p. e1490, 2023. https://doi.org/10.1002/widm.1490.
    67. A. Nambiar and D. Mundra, "An overview of data warehouse and data lake in modern enterprise data management," Big data and cognitive computing, vol. 6, no. 4, p. 132, 2022. https://doi.org/10.3390/bdcc6040132.
    68. A. O. Akindemowo et al., "A Conceptual Framework for Automating Data Pipelines Using ELT Tools in Cloud-Native Environments," 2021. https://doi.org/10.54660/.JFMR.2021.2.1.440-452.
    69. A. Jagadish, Essential Concepts and Techniques of AI & ML. Academic Guru Publishing House, 2024.
    70. D. K. Pandiya and N. Charankar, "Integration of microservices and AI for real-time data processing," International journal of computer engineering and technology (IJCET), vol. 14, no. 2, pp. 240-254, 2023.
    71. G. S. Reddy, "A review of data warehouses multidimensional model and data mining," Information Technology in Industry, vol. 9, no. 3, pp. 310-320, 2021.
    72. A. H. T. Al-Ghrairi, A. A. Mohammed, and H. M. Saeed, "An Application of Web-based E-Healthcare Management System Using ASP. Net," Webology, vol. 18, no. 1, 2021. https://doi.org/10.14704/WEB/V18I1/WEB18089.
    73. H. A. Ganatra, "Machine learning in pediatric healthcare: Current trends, challenges, and future directions," Journal of Clinical Medicine, vol. 14, no. 3, p. 807, 2025. https://doi.org/10.3390/jcm14030807.
    74. B. Zhou, G. Yang, Z. Shi, and S. Ma, "Natural language processing for smart healthcare," IEEE Reviews in Biomedical Engineering, vol. 17, pp. 4-18, 2022. https://doi.org/10.1109/RBME.2022.3210270.
  • Downloads

  • How to Cite

    Mohammed , H. J. . (2026). Software Concept and Knowledge Extraction for Data Mining In ‎Healthcare Sectors. International Journal of Basic and Applied Sciences, 15(1), 177-192. https://doi.org/10.14419/472vp602