Design of An Iterative End-to-End Multi-Modal Deep Learning Framework for Explainable Diagnosis of Alzheimer’s and Parkinson’s Diseases from Brain Imaging Process

  • Authors

    • Swati K. Mohod Research Scholar, Department of Electrical Engineering, Yashwantrao Chavan College of Engineering, Nagpur, India
    • Rajesh Thakare Department of Electrical Engineering, Yashwantrao Chavan College of Engineering, Nagpur, India
    https://doi.org/10.14419/damxzp29

    Received date: July 14, 2025

    Accepted date: August 1, 2025

    Published date: August 12, 2025

  • Neurodegenerative Diagnosis, Multi-Modal Imaging, Deep Learning, Alzheimer’s Disease, Parkinson’s Disease, Scenarios
  • Abstract

    The precise and early diagnosis of neurodegenerative diseases such as Alzheimer's Disease (AD) and Parkinson's Disease (PD) continues to pose a critical clinical challenge due to common symptoms as well as the progressive nature of these diseases. Most of the currently existing systems are characteristically mono-modal imaging or handcrafted features; hence their generalizability, robustness, and interpretability get limited. Moreover, the current models do not form any unified chain comprising preprocessing, region segmentation, feature fusion, classification, and explainability; this limits their deployment in real clinical settings. To address these challenges, this work proposes an integrative, end-to-end multi-modal diagnostic framework tailored for AD and PD detection using MRI, CT, and X-ray brain images and samples. Starting with Contrast Limited Adaptive Histogram Equalization (CLAHE) for image enhancement, it improves contrast and noise reduction. UNet++ is used in Region of Interest (ROI) segmentation targeting disease relevant locations, such as the hippocampus and basal ganglia. EfficientNet-B7 provides robust representation learning by extracting high dimensional embeddings from ROI-masked images, utilizing pretrained weights with medically fine-tuned retrievals. A novel use of a multi-head self-attention mechanism is then employed to detect features between modalities for best cross-modal integration. A CNN-Transformer hybrid model that effectively combines local spatiality and a global context awareness is employed to classify the data samples. Grad-CAM++ yields high-fidelity saliency maps for model explainability that closely aligns with radiologist annotations. High-performance metrics (Accuracy ≥ 95%, Sensitivity ≥ 92%, Specificity ≥ 93%) are attained by this system but also provide interpretable outputs and actionable insights in processing. The integration of more advanced deep learning modules within one pipeline marks a significant step towards a reliable, explainable, and clinically proven diagnosis of AD and PD Sets.

  • References

    1. Pan, Dan, Genqiang Luo, An Zeng, Chao Zou, Haolin Liang, and Jianbin Wang. (2024). Adaptive 3DCNN-based interpretable ensemble model for early diagnosis of Alzheimer’s disease. IEEE Transactions on Computational Social Systems, 11(1), 247–266. https://doi.org/10.1109/TCSS.2022.3223999
    2. Xu, Jiaxing, Qingtian Bian, Xinhang Li, Aihu Zhang, Yiping Ke, and Miao Qiao. (2024). Contrastive graph pooling for explainable classification of brain networks. IEEE Transactions on Medical Imaging, 43(9), 3292–3305. https://doi.org/10.1109/TMI.2024.3392988
    3. Dao, Q., El-Yacoubi, M. A., & Rigaud, A.-S. (2023). Detection of Alzheimer disease on online handwriting using 1D convolutional neural network. IEEE Access, 11, 2148–2155. https://doi.org/10.1109/ACCESS.2022.3232396
    4. Tang, X., Zhang, C., Guo, R., Yang, X., & Qian, X. (2024). A causality-aware graph convolutional network framework for rigidity assessment in Parkinsonians. IEEE Transactions on Medical Imaging, 43(1), 229–240. https://doi.org/10.1109/TMI.2023.3294182
    5. Guo, Rui, Xu Tian, Hanhe Lin, Stephen McKenna, Hong-Dong Li, and Fei Guo. (2024). Graph-based fusion of imaging, genetic and clinical data for degenerative disease diagnosis. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 21(1), 57–68. https://doi.org/10.1109/TCBB.2023.3335369
    6. Tawhid, M. N. A., Siuly, S., Wang, K., & Wang, H. (2023). Automatic and efficient framework for identifying multiple neurological disorders from EEG signals. IEEE Transactions on Technology and Society, 4(1), 76–86. https://doi.org/10.1109/TTS.2023.3239526
    7. Liu, J., Du, H., Guo, R., Bai, H. X., Kuang, H., & Wang, J. (2024). MMGK: Multimodality multiview graph representations and knowledge em-bedding for mild cognitive impairment diagnosis. IEEE Transactions on Computational Social Systems, 11(1), 389–398. https://doi.org/10.1109/TCSS.2022.3216483
    8. Dharwada, S., Tembhurne, J., & Diwan, T. (2024). An optimal weighted ensemble of 3D CNNs for early diagnosis of Alzheimer’s disease. SN Computer Science, 5, 252. https://doi.org/10.1007/s42979-023-02581-8
    9. Anderson, B., Thompson, J., & Roberts, M. (2021). Comparing ARIMA-based models and simple moving averages for BPM prediction in Power BI. Journal of Healthcare Engineering, 2021, 6671234.
    10. Wilson, G., Clark, S., & Turner, D. (2022). Deploying deep learning models trained on heart rate datasets via Power BI reports for healthcare pro-fessionals. IEEE Journal of Biomedical and Health Informatics, 26(5), 2345–2354.
  • Downloads

  • How to Cite

    Mohod, S. K. . ., & Thakare , R. . (2025). Design of An Iterative End-to-End Multi-Modal Deep Learning Framework for Explainable Diagnosis of Alzheimer’s and Parkinson’s Diseases from Brain Imaging Process. International Journal of Basic and Applied Sciences, 14(SI-2), 148-156. https://doi.org/10.14419/damxzp29