Design of An Iterative End-to-End Multi-Modal Deep Learning Framework for Explainable Diagnosis of Alzheimer’s and Parkinson’s Diseases from Brain Imaging Process
-
https://doi.org/10.14419/damxzp29
Received date: July 14, 2025
Accepted date: August 1, 2025
Published date: August 12, 2025
-
Neurodegenerative Diagnosis, Multi-Modal Imaging, Deep Learning, Alzheimer’s Disease, Parkinson’s Disease, Scenarios -
Abstract
The precise and early diagnosis of neurodegenerative diseases such as Alzheimer's Disease (AD) and Parkinson's Disease (PD) continues to pose a critical clinical challenge due to common symptoms as well as the progressive nature of these diseases. Most of the currently existing systems are characteristically mono-modal imaging or handcrafted features; hence their generalizability, robustness, and interpretability get limited. Moreover, the current models do not form any unified chain comprising preprocessing, region segmentation, feature fusion, classification, and explainability; this limits their deployment in real clinical settings. To address these challenges, this work proposes an integrative, end-to-end multi-modal diagnostic framework tailored for AD and PD detection using MRI, CT, and X-ray brain images and samples. Starting with Contrast Limited Adaptive Histogram Equalization (CLAHE) for image enhancement, it improves contrast and noise reduction. UNet++ is used in Region of Interest (ROI) segmentation targeting disease relevant locations, such as the hippocampus and basal ganglia. EfficientNet-B7 provides robust representation learning by extracting high dimensional embeddings from ROI-masked images, utilizing pretrained weights with medically fine-tuned retrievals. A novel use of a multi-head self-attention mechanism is then employed to detect features between modalities for best cross-modal integration. A CNN-Transformer hybrid model that effectively combines local spatiality and a global context awareness is employed to classify the data samples. Grad-CAM++ yields high-fidelity saliency maps for model explainability that closely aligns with radiologist annotations. High-performance metrics (Accuracy ≥ 95%, Sensitivity ≥ 92%, Specificity ≥ 93%) are attained by this system but also provide interpretable outputs and actionable insights in processing. The integration of more advanced deep learning modules within one pipeline marks a significant step towards a reliable, explainable, and clinically proven diagnosis of AD and PD Sets.
-
References
- Pan, Dan, Genqiang Luo, An Zeng, Chao Zou, Haolin Liang, and Jianbin Wang. (2024). Adaptive 3DCNN-based interpretable ensemble model for early diagnosis of Alzheimer’s disease. IEEE Transactions on Computational Social Systems, 11(1), 247–266. https://doi.org/10.1109/TCSS.2022.3223999
- Xu, Jiaxing, Qingtian Bian, Xinhang Li, Aihu Zhang, Yiping Ke, and Miao Qiao. (2024). Contrastive graph pooling for explainable classification of brain networks. IEEE Transactions on Medical Imaging, 43(9), 3292–3305. https://doi.org/10.1109/TMI.2024.3392988
- Dao, Q., El-Yacoubi, M. A., & Rigaud, A.-S. (2023). Detection of Alzheimer disease on online handwriting using 1D convolutional neural network. IEEE Access, 11, 2148–2155. https://doi.org/10.1109/ACCESS.2022.3232396
- Tang, X., Zhang, C., Guo, R., Yang, X., & Qian, X. (2024). A causality-aware graph convolutional network framework for rigidity assessment in Parkinsonians. IEEE Transactions on Medical Imaging, 43(1), 229–240. https://doi.org/10.1109/TMI.2023.3294182
- Guo, Rui, Xu Tian, Hanhe Lin, Stephen McKenna, Hong-Dong Li, and Fei Guo. (2024). Graph-based fusion of imaging, genetic and clinical data for degenerative disease diagnosis. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 21(1), 57–68. https://doi.org/10.1109/TCBB.2023.3335369
- Tawhid, M. N. A., Siuly, S., Wang, K., & Wang, H. (2023). Automatic and efficient framework for identifying multiple neurological disorders from EEG signals. IEEE Transactions on Technology and Society, 4(1), 76–86. https://doi.org/10.1109/TTS.2023.3239526
- Liu, J., Du, H., Guo, R., Bai, H. X., Kuang, H., & Wang, J. (2024). MMGK: Multimodality multiview graph representations and knowledge em-bedding for mild cognitive impairment diagnosis. IEEE Transactions on Computational Social Systems, 11(1), 389–398. https://doi.org/10.1109/TCSS.2022.3216483
- Dharwada, S., Tembhurne, J., & Diwan, T. (2024). An optimal weighted ensemble of 3D CNNs for early diagnosis of Alzheimer’s disease. SN Computer Science, 5, 252. https://doi.org/10.1007/s42979-023-02581-8
- Anderson, B., Thompson, J., & Roberts, M. (2021). Comparing ARIMA-based models and simple moving averages for BPM prediction in Power BI. Journal of Healthcare Engineering, 2021, 6671234.
- Wilson, G., Clark, S., & Turner, D. (2022). Deploying deep learning models trained on heart rate datasets via Power BI reports for healthcare pro-fessionals. IEEE Journal of Biomedical and Health Informatics, 26(5), 2345–2354.
-
Downloads
-
How to Cite
Mohod, S. K. . ., & Thakare , R. . (2025). Design of An Iterative End-to-End Multi-Modal Deep Learning Framework for Explainable Diagnosis of Alzheimer’s and Parkinson’s Diseases from Brain Imaging Process. International Journal of Basic and Applied Sciences, 14(SI-2), 148-156. https://doi.org/10.14419/damxzp29
