Abstract
This study aimed to address the issue of larger prediction errors existing in intelligent predictive tasks related to Alzheimer’s disease (AD). A cohort of 487 enrolled participants was categorized into three groups: normal control (138 individuals), mild cognitive impairment (238 patients), and AD (111 patients) in this study. An improved multifeature squeeze-and-excitation-dilated residual network (MFSE-DRN) was proposed for two important AD predictions: clinical scores and conversion probability. The model was characterized as three modules: squeeze-and-excitation-dilated residual block (SE-DRB), multifusion pooling (MF-Pool), and multimodal feature fusion. To assess its performance, the proposed model was compared with two other novel models: ranking convolutional neural network (RCNN) and 3D vision geometrical group network (3D-VGGNet). Our method showed the best performance in the two AD predicted tasks. For the clinical scores prediction, the root-mean-square errors (RMSEs) and mean absolute errors (MAEs) of mini-mental state examination (MMSE) and AD assessment scale–cognitive 11-item (ADAS-11) were 1.97, 1.46 and 4.20, 3.19 within 6 months; 2.48, 1.69 and 4.81, 3.44 within 12 months; 2.67, 1.86 and 5.81, 3.83 within 24 months; 3.02, 2.03 and 5.09, 3.43 within 36 months, respectively. At the AD conversion probability prediction, the prediction accuracies within 12, 24, and 36 months reached to 88.0, 85.5, and 88.4%, respectively. The AD predication would play a great role in clinical applications.
Similar content being viewed by others

Introduction
Alzheimer’s disease (AD) is a type of common neurodegenerative disease that generally accompanies with progressive impairments in human cognition, behavior, and analytical skills1. AD has a prevalence of 4.4% in people older than 65 years, and its treatment cost has exceeded 300 billion dollars worldwide2,3. Early prediction of AD could effectively help to intervene the progression in time and improve the outcome of treatment4. Recently, the intelligent prediction of AD via deep learning (DL) algorithms has brought new opportunities for early diagnosis of AD5.
During the progression of AD, mild cognitive impairment (MCI) is regarded as the critical transition period from normal to AD. Patients with MCI have been reported to be 10 times more likely than normal individuals to progress to AD, and about 70% of MCI patients deteriorate to AD within 5 years. MCI is generally classified into two subtypes: progressive MCI (pMCI) and stable MCI (sMCI), based on the progression of the disease. Timely intervention in the MCI stage has shown potential in delaying the prognosis of the disease and significantly improve the prognosis of patients. Therefore, accurate prediction of AD conversion probability at the MCI stages has garnered significant interest among clinical researchers6.
In clinical practice, the diagnostic methods of AD mainly include neuropsychological testing, neuroimaging, genetic testing, and biomarker testing. A series of neuropsychological tests, such as mini-mental state examination (MMSE), AD assessment scale-cognitive 11-item (ADAS-11), and so forth, played a crucial role in AD diagnosis and provided an in-depth insight into the state of cognitive functioning7,8. The neuroimaging modalities of magnetic resonance imaging (MRI) and positron emission tomography (PET) have been widely used in evaluating AD. For accurate quantitative evaluation of the structural changes in the brain along with AD progress, structural magnetic resonance imaging (sMRI) has become the most popular imaging technique in AD evaluation9. Thus far, several susceptibility genes have been identified for AD include amyloid precursor protein, presenilin 1, presenilin 2, and apolipoprotein E (ApoE). Among these genes, ApoE is closely correlated with AD occurrence. In addition, the biomarker testing was applied to monitor the state of AD progression by measuring the concentration of specific antibodies and proteins, including Aβ1-41, Aβ42/Aβ40, total tau protein (t-tau), and hyperphosphorylated tau protein (p-tau) in the cerebrospinal fluid or blood10,11. Therefore, with the multimodality clinical data, the deduced features of sMRI, genes, biomarkers, and clinical scores have been used for AD predication12,13,14,15,16.
In the literature, traditional ML and DL algorithms have made great progress in the AD classification. Once, Arafa et al.17 used a traditional CNN model for the classification of mild-dementia and non-dementia with an accuracy of 99.99%. And Fathi et al.18 applied six CNN classifiers to form an integrated model with up to 93.92% accuracy. Then, Helaly et al.19 used the traditional CNN model, the classification accuracies for 2D and 3D data of AD were 93.61 and 95.17%, respectively, and VGG19 models for transfer learning with up to 97% accuracy. Notably, Shankar et al.20 developed a novel hierarchical residual attention learning-inspired multistage conjoined twin network (HRAL-CTNN) and got an classification accuracy of 99.97%.
Besides, two crucial AD prediction tasks have been identified: clinical scores and conversion probability. For AD intelligent prediction, alterations in clinical scores and conversion probability at different time periods have been investigated by the traditional machine learning (ML) and DL algorithms.
It was reported that traditional ML algorithms were proposed for predicting clinical scores. Zhou et al.16 proposed a convex fused sparse group least absolute shrinkage and selection operator (LASSO) model using MRI and genetic and demographic features to predict clinical scores, and eventually obtained the root-mean-square errors (RMSEs) of 2.737 and 5.678 for MMSE and ADAS within 12 months, respectively. Then, Lei et al.21 used MRI and clinical score features to construct a feature selection model of LASSO and correlation entropy to predict MMSE within 18 months, and achieved a mean absolute error (MAE) of 1.74. Again, Tabarestani et al.22 applied random forest from PET, MRI, and genetic and clinical score features for MMSE prediction within 24 months, with an RMSE of 3.15.
Notably, the DL models achieved robust performance in clinical scores prediction. Tabarestani et al.23 used PET, MRI, clinical scores, and genetic and fluid biomarker features to construct a long short-term memory model to predict MMSE. They obtained an RMSE of 1.97 within 12 months. Liu et al.24 proposed a weakly supervised densely connected neural network model using combined features of MRI and clinical scores, and eventually obtained the prediction RMSEs of 3.408 and 7.451 for MMSE and ADAS within 24 months, respectively. Additionally, Zhang et al.25 used a sparse linear regression model combined with MRI, PET and clinical scoring features to predict the RMSE of 2.035 within 24 months for MMSE. Furthermore, Liu et al.26 proposed a deep multitask multichannel learning (DM2L) model with MRI and demographic features to predict ADAS with a RMSE of 6.2 within 36 months.
For the AD conversion probability prediction, a series of traditional ML algorithms have been addressed. Tangaro et al.27 proposed a support vector machine (SVM) based fuzzy class algorithm model using multifeatures of the hippocampal changes and clinical scores, and obtained an accuracy (ACC) of 83.4% for AD conversion probability within 12 months. Moreover, Shu et al.28 used an integrated ML model to combine MRI, clinical scores and genetic features, and got a conversion probability within 12 months with an area under the curve (AUC) of 0.814. Lin et al.29 implemented an extreme learning machine with MRI, PET, and genetic and biological features to achieve an ACC of 83.8% for predicting AD conversion probability within 24 months. Furthermore, Ezzati et al.30 applied an ensemble linear discriminant model using MRI and demographic and genetic features to obtain an ACC of 74.9% within 24 months. Also, Gaser et al.31 used a relevance vector machine model using clinical scores, hippocampus, and biomarker features to predict AD conversion probability within 36 months and obtained an ACC of 81%.
Accordingly, many DL algorithms have been proposed for AD conversion probability prediction. Recently, Llano et al.32 used MRI and clinical scores to predict the AD conversion probability using a tree-based multivariate model and obtained an AUC of 0.665 within 12 months. More recently, Lee et al.33 applied a recurrent neural network model with MRI, demographics, fluid biomarkers, and genetic features for predicting AD conversion probability within 24 months and obtained an ACC of 75%. Spasov et al.34 used MRI, clinical scores, and genetic and demographic features to construct a 3D divisible convolutional neural network for AD conversion probability prediction and got an ACC of 86% within 36 months. Then, Lu et al.35 used deep neural networks using PET and sMRI features and achieved an ACC of 82.4% within 36 months for AD conversion probability prediction. Overall, the DL models have shown better capability in the two predicted tasks of clinical scores and conversion probability in AD. However, there are still challenges to overcome, as large prediction errors and low accuracy remain a concern.
To resolve these problems, the 3D multifeature squeeze-and-excitation-dilated residual network (MFSE-DRN) was proposed for predicting clinical scores and conversion probability in AD. The MFSE-DRN model incorporates three novel modules: squeeze-and-excitation-dilated residual block (SE-DRB), multifusion pooling (MF-Pool), and multimodal feature fusion. These modules enhance the model’s ability to extract multimodal features while maintaining stability to prevent overfitting. By implementing the MFSE-DRN model, we accomplished longitudinal predictions of clinical scores and conversion probability in AD using baseline data. This approach offers a fresh perspective for early intelligent diagnosis of AD, providing valuable insights for improved patient care and management.
Results
Clinical scores prediction
During the ablation experiments, the proposed model showed lower RMSE and MAE values than the control group for MMSE and ADAS-11 score prediction at 6, 12, 24, and 36 months (M06, M12, M24, and M36) (Table 1). As shown in Table 1, even after removing the added modules sequentially, the proposed model exhibited significantly lower RMSE and MAE values than the control group for MMSE and ADAS-11 score prediction at M06, M12, M24, and M36. Thus, the results of the ablation experiment validated the effectiveness of each added module in AD prediction.
Table 2 shows that the proposed model demonstrated best performance in the two tasks of predicting MMSE and ADAS-11 clinical scores and conversion probability within M06, M12, M24, and M36. In MMSE prediction, the proposed model had the lowest RMSEs of 1.97, 2.48, 2.67, and 3.02 and MAEs of 1.46, 1.69, 1.86, and 2.03 within M06, M12, M24, and M36, respectively. For ADAS-11 prediction, the proposed model had the lowest RMSEs of 4.20, 4.81, 5.81, and 5.09 and MAEs of 3.19, 3.44, 3.83, and 3.43 within M06, M12, M24, and M36, respectively.
AD conversion probability
According to the ablation experiments, the proposed model obtained the highest predicted ACCs compared to ablation controls within M12, M24, and M36, respectively, as shown in Table 3. As depicted in Table 4, the proposed MFSE-DRN model exhibited superior capability in the AD conversion probability prediction with the highest ACCs of 88.0, 85.5, and 88.4% within M12, M24, and M36. As shown in Fig. 1, the AUCs of the proposed model reached to 0.74, 0.90, and 0.94 for AD conversion probability within M12, M24, and M36, respectively.
Discussion
In clinical practice, accurate predictions of clinical scores and conversion probability in AD play a crucial role in understanding the rate of disease progression and tailoring individualized treatment plans. The longitudinal prediction of AD demonstrated significant values in evaluating the progression of AD and preventing its onset. In this study, the proposed model exhibited robust performance in AD predictions of clinical scores and conversion probability. These findings highlight the potential of the model for aiding clinicians in making informed decisions and improving patient outcomes. This is expected to be applied to clinical research.
The combined features of sMRI, genetics, clinical scores, and biomarkers were confirmed to have high sensitivity and specificity for AD. The results showed that the fusion of multiple features ensured that the vital features were acquired during feature extraction and improved the model robustness. It was consistent with the previous findings that various types of data enhanced the model’s ability to learn more complementary information during training36,37,38. Therefore, this study utilized multiple types of baseline data to betterment the effectiveness of the model. Due to the absence of the follow-up data for the CDRSB and RAVLT immediate scores of the participants in the data set, it was worthy of noting that our study did not predict these two clinical scores.
Different from the existing AD prediction models, our proposed model did have substantial innovations in three aspects. Firstly, the construction of the DRB could extract internal core characteristics of sMRI facilitating efficient model39. Secondly, the incorporation of the SE structure enhanced the DRB’s ability to accurately capture feature correlations and maintain model stability when handing high-dimensional data40. Thirdly, the inclusion of both max pooling and average pooling layers in the multifusion pooling (MF-Pool) block enhanced the model’s capability to extract and delineate nonlinear features while also reducing dimensionality. This enabled the neural network to effectively extract high-dimensional features. As shown in previous studies, the use of multimodal features provides abundant feature information to precisely reflect the pathological process of AD and thus enhances the performance of the model to accurately predict AD41,42,43.
Our experimental results displayed that the prediction RMSEs and MAEs of clinical scores gradually increased along with the follow-up time points. This means that the prediction of clinical scores with typical brain morphology is more accurate in the early stages. This is because the hippocampal and internal olfactory cortical brain regions show a marked tendency to atrophy in the early stages of AD16. Similarly, it was validated that those cortices were highly correlated with the MMSE and ADAS in our experiment. Subsequently, the prediction error increased along with the trend of slow cognitive decline in the later stages of AD24,44. According to the AD conversion probability, there existed obvious trends of increased ACCs and AUCs along with the varied predicted time points. It was due to the fact that the significant changes of brain regions produced at the later AD stages would help to improve the predication accuracy45.
This study still had some limitations. Firstly, the enrolled samples from the public data set were very limited, this increased the likelihood of individual influence during data collection and might have even led to overfitting during the predicted tasks. Then, since no other available public data sets were provided, the validation of our model would resort to private and multi-center data sets. Finally, only one type of imaging modality of sMRI was used in our study, and the application of multiple imaging modalities of MR and/or PET is still worthy of trying and evaluating.
In future, more cases with multi-modal data should be involved to improve the performance of predictions. In addition, more kinds of DL strategies such as transfer learning and reinforcement learning should be integrated to improve the accuracy, and reliability of the models for AD prediction46,47. Especially, the interpretability for proposed model would be investigated to improve the visualization and analysis of the features for the hidden layers.
Materials and methods
Materials
Enrolled participants
A total of 487 participants was enrolled from the Alzheimer’s disease neuroimaging initiative (ADNI) database (www.adni-info.org), including 111 patients with AD, 238 patients with MCI, and 138 normal controls. The subjects are primarily non-Hispanic white subjects. All enrolled participants met with the criteria of the national institute of neurological and communicative disorders and stroke/AD and related disorders association.
Data description
The data sets of sMRI, clinical scores (MMSE, ADAS-11, clinical dementia rating scale sum of boxes (CDRSB), and rey auditory verbal learning test (RAVLT) immediate, genetics (ApoE4), and biomarker features (Aβ42/Aβ40 antibodies, t-tau, and p-tau) at baseline were collected for each participant. And the demographic information of the enrolled participants is presented in Table 5. The chi-square tests validated that the variables such as sex, age, education years, MMSE, ADAS-11, CDRSB, and RAVLT immediate were individually independent and conformed to normal distribution among the three groups. Meanwhile, the clinical scores of MMSE and ADAS-11 at longitudinal time points of M06, M12, M24, and M36 were also collected for model evaluation, as shown in Table 6. The conversion cases of MCI groups were divided into two subgroups, pMCI and sMCI, as depicted in Table 7.
All participants underwent MR scanning of the brain, which were acquired using 3 T scanners from Siemens, general electric (GE), and Philips at multiple sites. T1-weighted images were acquired, and the protocol parameters were listed as follows: 1.2 mm slice thickness; 256 × 256 scanning matrix; repetition time = 2300 ms; echo time = 2.98 ms; field of view = 240 × 240 mm2; flip angle = 90 degree; and 256 × 256 reconstruction matrix.
Methods
The pipeline of the multifeature AD prediction included three steps: data preprocessing, model construction, and experimental setup (Fig. 2). Firstly, sMRI, genetic features, biological features and clinical information were individually preprocessed; secondly, the MFSE-DRN using the blocks of SE-DRB, MF-Pool and multi-feature fusion was constructed; and finally, the clinical scores and AD conversion probabilities were predicted.
Data preprocessing
The data prepossessing contained two aspects: one for the sMRI data set, and another for clinical scores, genetic and biomarker data sets. The sMRI data preprocessing consisted of three steps: image registration, non-brain tissue segmentation, and image enhancement. It was performed using the computational anatomy toolbox 12 (CAT12) software package (https://www.nitrc.org/projects/cat/). Firstly, the sMRI images were registered using the diffeomorphic anatomical registration through exponentiated lie algebra (DARTEL) method to ensure brain structure alignment across individuals. Secondly, non-brain tissues, such as the skull and scalp, were removed using morphology-based methods. Thirdly, the image enhancement was achieved through histogram equalization, spatial filtering, and nonlinear transformation methods. Besides, the raw data sets of clinical scores, genes, and biomarkers were normalized due to the varied magnitude scales.
Model construction of MFSE-DRN
Based on the classical residual network (DRN), our modified model integrated three key modules of SE-DRB, MF-Pool, and multifeature fusion. As illustrated in Fig. 3, our model comprised of 36 layers, including 32 convolutional (Conv) layers (16 SE-DRBs), one 7 × 7 × 7 Conv layer, 2 MF-Pool layers, and one multifeature fusion layer. The data pipeline of MFSE-DRN is represented as pseudo-code, as shown in Table 8.
The architecture of MFSE-DRN. The solid lines indicate unchanged number of input and output channels, while the dashed lines denote changed number of input and output channels for short connections. MFSE-DRN: multifeature squeeze-and-excitation-dilated residual network, MF-Pool: multifusion pooling, FC: fully connected, SE-DRB: squeeze-and-excitation-dilated residual block.
The SE-DRB was assembled by dilated convolution (DC) and squeeze-and-excitation (SE) blocks and enlarged the receptive convolution field for feature extraction. Then, the MF-Pool combined the max pooling and average pooling layers for extracting varying dimensions features. Furthermore, the multifeature fusion could effectively handle high-dimensional data. It concatenated concurrent features different levels of multiple features to extract the most representative features, and utilized the correlation and complementarity between different features to improve the accuracy of prediction.
The pipeline of our model was elucidated as follows: Firstly, the sMRI data was downsampled using a 7 × 7 × 7 Conv layer and a 3 × 3 × 3 MF-Pool layer. Next, the high-dimensional image features were extracted by the 16 SE-DRBs and outputted to the MF-Pool layer. At last, the image features were integrated with genetic features, clinical scores, and biological features, and the multi-features were connected with the fully connected (FC) layer. Since the proposed MF-Pool combined the max pooling and average pooling layers, and the capability of feature extraction at varied dimensions was effectively enhanced48. The final output of the MF-Pool is formulated as follows:
where Y is the final output of the MF-Pool; Y1 is the output of the maximum pooling layer; Y2 is the output of the average pooling layer; n is the number of feature maps; c is the number of channels; h is the number of rows; ω is the number of columns; s is the step size; kh is the pooling window length; kω is the pooling window width.
With the improved DRB, it facilitated the feature extraction capability for additional features by enlarging the Conv receptive field in residual networks. In our model, the DRB was constructed by replacing the first layer of the base convolution in the base residual module. Here, the base convolution kernel k = 3, DC expansion rate d = 2, equivalent convolution kernel k’ = 5, and current sensory field RFi+1 = 7.
Additionally, the use of residual structure in our network architecture can allow for deeper and more complex models to be trained, and the short-circuit connection can effectively avoid the overfitting problem caused by the model depth49. In order to improve the model stability, the SE-DRB was built by integrating the SE with the DRB, as illustrated in Fig. 4. This made a more precise capture of feature correlations by assigning adaptive weights to features across various channels, thus significantly elevating the network performance.
In Fig. 5, the multimodal feature fusion block combined the image features extracted from the Conv layer with the genes, clinical scores, and biometric features, and acted on the FC layer to effectively extract multifeature information to achieve longitudinal prediction of AD.
Experimental setup
In this study, two AD predicted tasks of clinical scores and conversion probability were performed. The first was to predict the MMSE and ADAS-11 scores at M06, M12, M24, and M36, and the second task was to predict the conversion probability of patients with MCI progressed to AD at M12, M24, and M36, respectively.
For performance evaluation, our proposed model was quantitatively compared with two other novel models: ranking convolutional neural network (RCNN) and 3D vision geometrical group network (3D-VGGNet)50,51. The ablation experiments were conducted by individually removing the three mentioned modules in the model, and five-fold cross-validation method with repeated random sampling is used.
During the experiments, The main hyperparameters were as follows: (1) during the clinical scores prediction, the number of training rounds was 150, the batch size was 8, the learning rate was 0.0001, the loss function was mean square error (MSE), and the optimizer was chosen as Adam; and (2) at the AD conversion probability prediction, the number of training rounds was 100, the batch size was 4, the learning rate was 0.0001, the loss function was Cross Entropy, and the Adam was chosen as optimizer. The regularization terms of L1 and L2 were implemented. Here, L1 produces a sparse matrix of weights and L2 allows the weights to decrease uniformly. This would ensure the model stability and avoid model overfitting.
Moreover, two metrics of MAE and RMSE were chosen for evaluating clinical scores prediction52, and five indices of ACC, precision, recall, F1, receiver operating characteristic, and AUC53 were defined to assess AD conversion probability prediction54.
The DL experiments were performed on a Windows 10 Pro workstation using an NVIDIA RTX 6000 graphics card. The CUDA version 11.3 and Python version 3.8 software were used with the DL framework of PyTorch.
Ethical approval
The present study was approved by The Shanghai University of Medicine and Health Sciences ethics review committee (approval number, 2019-GZR-06-142132197606243519). All research methods were conducted in strict accordance with relevant guidlines and regulations. We hereby confirm that informed consent was obtained from all subjects and/or their legal guardians who provided data.
Data availability
The datasets generated or analyzed during the study are available in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) repository, https://adni.loni.usc.edu/.
References
Sharma, R., Goel, T., Tanveer, M., Dwivedi, S. & Murugan, R. FAF-DRVFL: Fuzzy activation function based deep random vector functional links network for early diagnosis of Alzheimer disease. Appl. Soft Comput. 106, 107371. https://doi.org/10.1016/j.asoc.2021.107371 (2021).
Hao, N. et al. Acoustofluidic multimodal diagnostic system for Alzheimer’s disease. Biosens. Bioelectron. 196, 113730. https://doi.org/10.1016/j.bios.2021.113730 (2022).
Niu, H., Álvarez-Álvarez, I., Guillén-Grima, F. & Aguinaga-Ontoso, I. Prevalence and incidence of Alzheimer’s disease in Europe: A meta-analysis. Neurología (English Edition) 32, 523–532. https://doi.org/10.1016/j.nrleng.2016.02.009 (2017).
Liss, J. et al. Practical recommendations for timely, accurate diagnosis of symptomatic Alzheimer’s disease (MCI and Dementia) in primary care: A review and synthesis. J. Intern. Med. https://doi.org/10.1111/joim.13244 (2021).
Frederiksen, K. S., Gjerum, L., Waldemar, G. & Hasselbalch, S. G. Effects of physical exercise on Alzheimer’s disease biomarkers: A systematic review of intervention studies. J Alzheimers Dis. 61, 359–372. https://doi.org/10.3233/jad-170567 (2018).
Shen, T. et al. Decision supporting model for one-year conversion probability from MCI to AD using CNN and SVM. in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). 738–741, https://doi.org/10.1109/EMBC.2018.8512398 (2018).
Dos Santos Picanco, L. C. et al. Alzheimer’s disease: A review from the pathophysiology to diagnosis, new perspectives for pharmacological treatment. Curr. Med. Chem. 25(26), 3141–3159. https://doi.org/10.2174/0929867323666161213101126 (2018).
Huang, L., Jin, Y., Gao, Y., Thung, K.-H. & Shen, D. Longitudinal clinical score prediction in Alzheimer’s disease with soft-split sparse regression based random forest. Neurobiol. Aging 46, 180–191. https://doi.org/10.1016/j.neurobiolaging.2016.07.005 (2016).
Simon, M. J. & Iliff, J. J. Regulation of cerebrospinal fluid (CSF) flow in neurodegenerative, neurovascular and neuroinflammatory disease. Biochimica et Biophysica Acta Mol. Basis Dis. 1862, 442–451. https://doi.org/10.1016/j.bbadis.2015.10.014 (2016).
Bell, R. D. et al. Apolipoprotein E controls cerebrovascular integrity via cyclophilin A. Nature 485, 512–516. https://doi.org/10.1038/nature11087 (2012).
Uddin, M. S. et al. APOE and Alzheimer’s disease: Evidence mounts that targeting APOE4 may combat Alzheimer’s pathogenesis. Mol. Neurobiol. 56, 2450–2465. https://doi.org/10.1007/s12035-018-1237-z (2019).
Pozueta, A. et al. Detection of early Alzheimer’s disease in MCI patients by the combination of MMSE and an episodic memory test. BMC Neurol. 11, 78. https://doi.org/10.1186/1471-2377-11-78 (2011).
Zhang, X. et al. Metrological properties of neuropsychological tests for measuring cognitive change in individuals with prodromal Alzheimer’s disease. Aging Mental Health 26, 1988–1996. https://doi.org/10.1080/13607863.2021.1966746 (2022).
Grundman, M. et al. Mild cognitive impairment can be distinguished from Alzheimer disease and normal aging for clinical trials. Arch. Neurol. 61, 59–66. https://doi.org/10.1001/archneur.61.1.59 (2004).
Ito, K., Hutmacher, M. M. & Corrigan, B. W. Modeling of Functional Assessment Questionnaire (FAQ) as continuous bounded data from the ADNI database. J. Pharmacokinet. Pharmacodyn. 39, 601–618. https://doi.org/10.1007/s10928-012-9271-3 (2012).
Zhou, J., Liu, J., Narayan, V. A. & Ye, J. Modeling disease progression via multi-task learning. NeuroImage 78, 233–248. https://doi.org/10.1016/j.neuroimage.2013.03.073 (2013).
Arafa, D. A. et al. A deep learning framework for early diagnosis of Alzheimer’s disease on MRI images. Multimed. Tools Appl. 83, 3767–3799. https://doi.org/10.1007/s11042-023-15738-7 (2024).
Fathi, S. et al. A deep learning-based ensemble method for early diagnosis of Alzheimer’s disease using MRI images. Neuroinformatics 22(1), 89–105. https://doi.org/10.1007/s12021-023-09646-2 (2024).
Helaly, H. A., Badawy, M. & Haikal, A. Y. Deep learning approach for early detection of Alzheimer’s disease. Cogn. Comput. 14(5), 1711–1727. https://doi.org/10.1007/s12559-021-09946-2 (2022).
Shankar, V. G., Sisodia, D. S. & Chandrakar, P. An intelligent hierarchical residual attention learning-based conjoined twin neural network for Alzheimer’s stage detection and prediction. Comput. Intell. 39, 783–805. https://doi.org/10.1111/coin.12594 (2023).
Lei, B. et al. Predicting clinical scores for Alzheimer’s disease based on joint and deep learning. Expert Syst. Appl. 187, 115966. https://doi.org/10.1016/j.eswa.2021.115966 (2022).
Tabarestani, S. et al. A tensorized multitask deep learning network for progression prediction of Alzheimer’s disease. Front. Aging Neurosci. 14, 810873. https://doi.org/10.3389/fnagi.2022.810873 (2022).
Tabarestani, S., et al. Longitudinal prediction modeling of Alzheimer disease using recurrent neural networks. 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). pp 1–4. https://doi.org/10.1109/BHI.2019.8834556 (2019).
Liu, M., Zhang, J., Lian, C. & Shen, D. Weakly supervised deep learning for brain disease prognosis using MRI and incomplete clinical scores. IEEE Trans. Cybern. 50, 3381–3392. https://doi.org/10.1109/TCYB.2019.2904186 (2020).
Zhang, D., Shen, D., Alzheimer’s Disease Neuroimaging Initiative. Predicting future clinical changes of MCI patients using longitudinal and multimodal biomarkers. PLOS ONE 7, e33182. https://doi.org/10.1371/journal.pone.0033182 (2012).
Liu, M., Zhang, J., Adeli, E. & Shen, D. Deep Multi-task Multi-channel learning for joint classification and regression of brain status. Medical image computing and computer-assisted intervention: MICCAI International Conference on Medical Image Computing and Computer-Assisted Intervention. 10435, 3–11, https://doi.org/10.1007/978-3-319-66179-7_1 (2017).
Tangaro, S., Fanizzi, A., Amoroso, N. & Bellotti, R. A fuzzy-based system reveals Alzheimer’s Disease onset in subjects with Mild Cognitive Impairment. Phys. Med. 38, 36–44. https://doi.org/10.1016/j.ejmp.2017.04.027 (2017).
Shu, Z.-Y. et al. Prediction of the progression from mild cognitive impairment to Alzheimer’s disease using a radiomics-integrated model. Ther. Adv. Neurol. Disord. 14, 17562864211029552. https://doi.org/10.1177/17562864211029551 (2021).
Lin, W. et al. Predicting Alzheimer’s disease conversion from mild cognitive impairment using an extreme learning machine-based grading method with multimodal data. Front. Aging Neurosci. https://doi.org/10.3389/fnagi.2020.00077 (2020).
Ezzati, A. et al. Optimizing machine learning methods to improve predictive models of Alzheimer’s disease. J. Alzheimer’s Dis. 71, 1027–1036. https://doi.org/10.3233/JAD-190262 (2019).
Gaser, C. et al. BrainAGE in mild cognitive impaired patients: Predicting the conversion to Alzheimer’s disease. PLOS ONE 8, e67346. https://doi.org/10.1371/journal.pone.0067346 (2013).
Llano, D. A., Laforet, G. & Devanarayan, V. Derivation of a new ADAS-cog composite using tree-based multivariate analysis: Prediction of conversion from mild cognitive impairment to Alzheimer disease. Alzheimer Dis. Assoc. Disord. https://doi.org/10.1097/WAD.0b013e3181f5b8d8 (2011).
Lee, G. et al. Predicting Alzheimer’s disease progression using multi-modal deep learning approach. Sci. Rep. 9, 1952. https://doi.org/10.1038/s41598-018-37769-z (2019).
Spasov, S., Passamonti, L., Duggento, A., Liò, P. & Toschi, N. A parameter-efficient deep learning approach to predict conversion from mild cognitive impairment to Alzheimer’s disease. NeuroImage 189, 276–287. https://doi.org/10.1016/j.neuroimage.2019.01.031 (2019).
Lu, D. et al. Multimodal and multiscale deep neural networks for the early diagnosis of Alzheimer’s disease using structural MR and FDG-PET images. Sci. Rep. 8, 5697. https://doi.org/10.1038/s41598-018-22871-z (2018).
Guan, H., Wang, C. & Tao, D. MRI-based Alzheimer’s disease prediction via distilling the knowledge in multi-modal data. NeuroImage 244, 118586. https://doi.org/10.1016/j.neuroimage.2021.118586 (2021).
Prabhu, S. S. et al. Multi-modal deep learning models for Alzheimer’s disease prediction using MRI and EHR. 2022 IEEE 22nd International Conference on Bioinformatics and Bioengineering (BIBE). 168–173, https://doi.org/10.1109/BIBE55377.2022.00044 (2022).
Ismail, W. N. et al. A meta-heuristic multi-objective optimization method for Alzheimer’s disease detection based on multi-modal data. Mathematics 11(4), 957. https://doi.org/10.3390/math11040957 (2023).
Gu, Z. et al. CE-Net: Context encoder network for 2D medical image segmentation. IEEE Trans. Med. Imaging. 38, 2281–2292. https://doi.org/10.1109/TMI.2019.2903562 (2019).
Hu, J., Shen, L., Albanie, S., Sun, G. & Wu, E. Squeeze-and-excitation networks. IEEE Trans. Pattern Anal. Mach. Intell. 42, 2011–2023. https://doi.org/10.1109/TPAMI.2019.2913372 (2020).
Payan, A. & Montana, G. Predicting Alzheimer’s disease: A neuroimaging study with 3D convolutional neural networks. ICPRAM 2015 - 4th International Conference on Pattern Recognition Applications and Methods, Proceedings. 2, (2015).
Zhang, T. & Shi, M. Multi-modal neuroimaging feature fusion for diagnosis of Alzheimer’s disease. J. Neurosci. Methods. 341, 108795. https://doi.org/10.1016/j.jneumeth.2020.108795 (2020).
Minhas, S. et al. Early MCI-to-AD conversion prediction using future value forecasting of multimodal features. Comput. Intell. Neurosci. 2021, 6628036. https://doi.org/10.1155/2021/6628036 (2021).
Galasko, D. R., Gould, R. L., Abramson, I. S. & Salmon, D. P. Measuring cognitive change in a cohort of patients with Alzheimer’s disease. Stat. Med. 19, 1421–1432. https://doi.org/10.1002/(SICI)1097-0258(20000615/30)19:11/12%3c1421::AID-SIM434%3e3.0.CO;2-P (2000).
Hinrichs, C., Singh, V., Xu, G. & Johnson, S. C. Predictive markers for AD in a multi-modality framework: An analysis of MCI progression in the ADNI population. NeuroImage 55, 574–589. https://doi.org/10.1016/j.neuroimage.2010.10.081 (2011).
Lian, C., Liu, M., Pan, Y. & Shen, D. Attention-guided hybrid network for dementia diagnosis with structural MR images. IEEE Trans. Cybern. 52, 1992–2003. https://doi.org/10.1109/TCYB.2020.3005859 (2022).
Duc, N. T. et al. 3D-deep learning based automatic diagnosis of Alzheimer’s disease with joint MMSE prediction using resting-state fMRI. Neuroinformatics 18, 71–86. https://doi.org/10.1007/s12021-019-09419-w (2020).
Yang, J. & Li, J. Application of deep convolution neural network. 2017 14th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP). 229–232, https://doi.org/10.1109/ICCWAMTIP.2017.8301485 (2017).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp 770–778, https://doi.org/10.1109/CVPR.2016.90 (2016).
Qiao, H., Chen, L. & Zhu, F. Ranking convolutional neural network for Alzheimer’s disease mini-mental state examination prediction at multiple time-points. Comput. Methods Progr. Biomed. 213, 106503. https://doi.org/10.1016/j.cmpb.2021.106503 (2022).
Hridayami, P., Putra, I. & Wibawa, K. Fish species recognition using VGG16 Deep convolutional neural network. J. Comput. Sci. Eng. 13, 124–130. https://doi.org/10.5626/JCSE.2019.13.3.124 (2019).
Murdaca, G. et al. Vitamin D and folate as predictors of MMSE in Alzheimer’s disease: A machine learning analysis. Diagnostics 11(6), 940. https://doi.org/10.3390/diagnostics11060940 (2021).
Huang, H. et al. Voxel-based morphometry and a deep learning model for the diagnosis of early Alzheimer’s disease based on cerebral gray matter changes. Cereb. Cortex 33(3), 754–763. https://doi.org/10.1093/cercor/bhac099 (2023).
Memiş, S., Enginoğlu, S. & Erkan, U. Fuzzy parameterized fuzzy soft k-nearest neighbor classifier. Neurocomputing 500, 351–378. https://doi.org/10.1016/j.neucom.2022.05.041 (2022).
Funding
This study was funded by the National Natural Science Foundation of China (Nos. 61971275, 81830052, and 82072228), the grants of the National Key Research and Development Program of China (2020YFC2008700) and Shanghai Municipal Commission of Science and Technology for Capacity Building for Local Universities (23010502700). The data collection and sharing for the project were funded by the ADNI (National Institutes of Health Grant No. U01 AG024904).
Author information
Authors and Affiliations
Contributions
Z.Y. and X.L. conceptualized and designed the methodology. Z.Y., X.L., Z.H. and Z.T. conducted the investigation. X.Y. and T.W. administered the project. Z.Y. and X.L. developed AI, including creating algorithms, learning, and visualization. X.L., Z.H. and Z.T. validated the results. Z.Y. and X.L. performed the statistical analysis. X.Y. provided supervision. X.Y. acquired funding. All authors contributed to writing the original draft, reviewed the manuscript, and approved the final version before submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yuan, Z., Li, X., Hao, Z. et al. Intelligent prediction of Alzheimer’s disease via improved multifeature squeeze-and-excitation-dilated residual network. Sci Rep 14, 11994 (2024). https://doi.org/10.1038/s41598-024-62712-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-62712-w
This article is cited by
-
AlzONet: a deep learning optimized framework for multiclass Alzheimer’s disease diagnosis using MRI brain imaging
The Journal of Supercomputing (2025)