Intelligent prediction of Alzheimer’s disease via improved multifeature squeeze-and-excitation-dilated residual network

Yuan, Zengbei; Li, Xinlin; Hao, Zezhou; Tang, Zhixian; Yao, Xufeng; Wu, Tao

doi:10.1038/s41598-024-62712-w

Download PDF

Article
Open access
Published: 25 May 2024

Intelligent prediction of Alzheimer’s disease via improved multifeature squeeze-and-excitation-dilated residual network

Zengbei Yuan^1,2^na1,
Xinlin Li^1,2^na1,
Zezhou Hao^1,2,
Zhixian Tang¹,
Xufeng Yao¹ &
…
Tao Wu¹

Scientific Reports volume 14, Article number: 11994 (2024) Cite this article

1554 Accesses
2 Citations
10 Altmetric
Metrics details

Subjects

Abstract

This study aimed to address the issue of larger prediction errors existing in intelligent predictive tasks related to Alzheimer’s disease (AD). A cohort of 487 enrolled participants was categorized into three groups: normal control (138 individuals), mild cognitive impairment (238 patients), and AD (111 patients) in this study. An improved multifeature squeeze-and-excitation-dilated residual network (MFSE-DRN) was proposed for two important AD predictions: clinical scores and conversion probability. The model was characterized as three modules: squeeze-and-excitation-dilated residual block (SE-DRB), multifusion pooling (MF-Pool), and multimodal feature fusion. To assess its performance, the proposed model was compared with two other novel models: ranking convolutional neural network (RCNN) and 3D vision geometrical group network (3D-VGGNet). Our method showed the best performance in the two AD predicted tasks. For the clinical scores prediction, the root-mean-square errors (RMSEs) and mean absolute errors (MAEs) of mini-mental state examination (MMSE) and AD assessment scale–cognitive 11-item (ADAS-11) were 1.97, 1.46 and 4.20, 3.19 within 6 months; 2.48, 1.69 and 4.81, 3.44 within 12 months; 2.67, 1.86 and 5.81, 3.83 within 24 months; 3.02, 2.03 and 5.09, 3.43 within 36 months, respectively. At the AD conversion probability prediction, the prediction accuracies within 12, 24, and 36 months reached to 88.0, 85.5, and 88.4%, respectively. The AD predication would play a great role in clinical applications.

Screening and predicting progression from high-risk mild cognitive impairment to Alzheimer’s disease

Article Open access 02 September 2021

The clinical significance of an AI-based assumption model for neurocognitive diseases using a novel dual-task system

Article Open access 22 April 2025

Exploring the relationship among Alzheimer’s disease, aging and cognitive scores through neuroimaging-based approach

Article Open access 10 November 2024

Introduction

Alzheimer’s disease (AD) is a type of common neurodegenerative disease that generally accompanies with progressive impairments in human cognition, behavior, and analytical skills¹. AD has a prevalence of 4.4% in people older than 65 years, and its treatment cost has exceeded 300 billion dollars worldwide^2,3. Early prediction of AD could effectively help to intervene the progression in time and improve the outcome of treatment⁴. Recently, the intelligent prediction of AD via deep learning (DL) algorithms has brought new opportunities for early diagnosis of AD⁵.

During the progression of AD, mild cognitive impairment (MCI) is regarded as the critical transition period from normal to AD. Patients with MCI have been reported to be 10 times more likely than normal individuals to progress to AD, and about 70% of MCI patients deteriorate to AD within 5 years. MCI is generally classified into two subtypes: progressive MCI (pMCI) and stable MCI (sMCI), based on the progression of the disease. Timely intervention in the MCI stage has shown potential in delaying the prognosis of the disease and significantly improve the prognosis of patients. Therefore, accurate prediction of AD conversion probability at the MCI stages has garnered significant interest among clinical researchers⁶.

In clinical practice, the diagnostic methods of AD mainly include neuropsychological testing, neuroimaging, genetic testing, and biomarker testing. A series of neuropsychological tests, such as mini-mental state examination (MMSE), AD assessment scale-cognitive 11-item (ADAS-11), and so forth, played a crucial role in AD diagnosis and provided an in-depth insight into the state of cognitive functioning^7,8. The neuroimaging modalities of magnetic resonance imaging (MRI) and positron emission tomography (PET) have been widely used in evaluating AD. For accurate quantitative evaluation of the structural changes in the brain along with AD progress, structural magnetic resonance imaging (sMRI) has become the most popular imaging technique in AD evaluation⁹. Thus far, several susceptibility genes have been identified for AD include amyloid precursor protein, presenilin 1, presenilin 2, and apolipoprotein E (ApoE). Among these genes, ApoE is closely correlated with AD occurrence. In addition, the biomarker testing was applied to monitor the state of AD progression by measuring the concentration of specific antibodies and proteins, including Aβ1-41, Aβ42/Aβ40, total tau protein (t-tau), and hyperphosphorylated tau protein (p-tau) in the cerebrospinal fluid or blood^10,11. Therefore, with the multimodality clinical data, the deduced features of sMRI, genes, biomarkers, and clinical scores have been used for AD predication^{12,13,14,15,16}.

In the literature, traditional ML and DL algorithms have made great progress in the AD classification. Once, Arafa et al.¹⁷ used a traditional CNN model for the classification of mild-dementia and non-dementia with an accuracy of 99.99%. And Fathi et al.¹⁸ applied six CNN classifiers to form an integrated model with up to 93.92% accuracy. Then, Helaly et al.¹⁹ used the traditional CNN model, the classification accuracies for 2D and 3D data of AD were 93.61 and 95.17%, respectively, and VGG19 models for transfer learning with up to 97% accuracy. Notably, Shankar et al.²⁰ developed a novel hierarchical residual attention learning-inspired multistage conjoined twin network (HRAL-CTNN) and got an classification accuracy of 99.97%.

Besides, two crucial AD prediction tasks have been identified: clinical scores and conversion probability. For AD intelligent prediction, alterations in clinical scores and conversion probability at different time periods have been investigated by the traditional machine learning (ML) and DL algorithms.

It was reported that traditional ML algorithms were proposed for predicting clinical scores. Zhou et al.¹⁶ proposed a convex fused sparse group least absolute shrinkage and selection operator (LASSO) model using MRI and genetic and demographic features to predict clinical scores, and eventually obtained the root-mean-square errors (RMSEs) of 2.737 and 5.678 for MMSE and ADAS within 12 months, respectively. Then, Lei et al.²¹ used MRI and clinical score features to construct a feature selection model of LASSO and correlation entropy to predict MMSE within 18 months, and achieved a mean absolute error (MAE) of 1.74. Again, Tabarestani et al.²² applied random forest from PET, MRI, and genetic and clinical score features for MMSE prediction within 24 months, with an RMSE of 3.15.

Notably, the DL models achieved robust performance in clinical scores prediction. Tabarestani et al.²³ used PET, MRI, clinical scores, and genetic and fluid biomarker features to construct a long short-term memory model to predict MMSE. They obtained an RMSE of 1.97 within 12 months. Liu et al.²⁴ proposed a weakly supervised densely connected neural network model using combined features of MRI and clinical scores, and eventually obtained the prediction RMSEs of 3.408 and 7.451 for MMSE and ADAS within 24 months, respectively. Additionally, Zhang et al.²⁵ used a sparse linear regression model combined with MRI, PET and clinical scoring features to predict the RMSE of 2.035 within 24 months for MMSE. Furthermore, Liu et al.²⁶ proposed a deep multitask multichannel learning (DM2L) model with MRI and demographic features to predict ADAS with a RMSE of 6.2 within 36 months.

For the AD conversion probability prediction, a series of traditional ML algorithms have been addressed. Tangaro et al.²⁷ proposed a support vector machine (SVM) based fuzzy class algorithm model using multifeatures of the hippocampal changes and clinical scores, and obtained an accuracy (ACC) of 83.4% for AD conversion probability within 12 months. Moreover, Shu et al.²⁸ used an integrated ML model to combine MRI, clinical scores and genetic features, and got a conversion probability within 12 months with an area under the curve (AUC) of 0.814. Lin et al.²⁹ implemented an extreme learning machine with MRI, PET, and genetic and biological features to achieve an ACC of 83.8% for predicting AD conversion probability within 24 months. Furthermore, Ezzati et al.³⁰ applied an ensemble linear discriminant model using MRI and demographic and genetic features to obtain an ACC of 74.9% within 24 months. Also, Gaser et al.³¹ used a relevance vector machine model using clinical scores, hippocampus, and biomarker features to predict AD conversion probability within 36 months and obtained an ACC of 81%.

Accordingly, many DL algorithms have been proposed for AD conversion probability prediction. Recently, Llano et al.³² used MRI and clinical scores to predict the AD conversion probability using a tree-based multivariate model and obtained an AUC of 0.665 within 12 months. More recently, Lee et al.³³ applied a recurrent neural network model with MRI, demographics, fluid biomarkers, and genetic features for predicting AD conversion probability within 24 months and obtained an ACC of 75%. Spasov et al.³⁴ used MRI, clinical scores, and genetic and demographic features to construct a 3D divisible convolutional neural network for AD conversion probability prediction and got an ACC of 86% within 36 months. Then, Lu et al.³⁵ used deep neural networks using PET and sMRI features and achieved an ACC of 82.4% within 36 months for AD conversion probability prediction. Overall, the DL models have shown better capability in the two predicted tasks of clinical scores and conversion probability in AD. However, there are still challenges to overcome, as large prediction errors and low accuracy remain a concern.

To resolve these problems, the 3D multifeature squeeze-and-excitation-dilated residual network (MFSE-DRN) was proposed for predicting clinical scores and conversion probability in AD. The MFSE-DRN model incorporates three novel modules: squeeze-and-excitation-dilated residual block (SE-DRB), multifusion pooling (MF-Pool), and multimodal feature fusion. These modules enhance the model’s ability to extract multimodal features while maintaining stability to prevent overfitting. By implementing the MFSE-DRN model, we accomplished longitudinal predictions of clinical scores and conversion probability in AD using baseline data. This approach offers a fresh perspective for early intelligent diagnosis of AD, providing valuable insights for improved patient care and management.

Results

Clinical scores prediction

During the ablation experiments, the proposed model showed lower RMSE and MAE values than the control group for MMSE and ADAS-11 score prediction at 6, 12, 24, and 36 months (M06, M12, M24, and M36) (Table 1). As shown in Table 1, even after removing the added modules sequentially, the proposed model exhibited significantly lower RMSE and MAE values than the control group for MMSE and ADAS-11 score prediction at M06, M12, M24, and M36. Thus, the results of the ablation experiment validated the effectiveness of each added module in AD prediction.

Table 1 Ablation comparison of clinical scores prediction.

Full size table

Table 2 shows that the proposed model demonstrated best performance in the two tasks of predicting MMSE and ADAS-11 clinical scores and conversion probability within M06, M12, M24, and M36. In MMSE prediction, the proposed model had the lowest RMSEs of 1.97, 2.48, 2.67, and 3.02 and MAEs of 1.46, 1.69, 1.86, and 2.03 within M06, M12, M24, and M36, respectively. For ADAS-11 prediction, the proposed model had the lowest RMSEs of 4.20, 4.81, 5.81, and 5.09 and MAEs of 3.19, 3.44, 3.83, and 3.43 within M06, M12, M24, and M36, respectively.

Table 2 Performance comparison of clinical scores prediction.

Full size table

AD conversion probability

According to the ablation experiments, the proposed model obtained the highest predicted ACCs compared to ablation controls within M12, M24, and M36, respectively, as shown in Table 3. As depicted in Table 4, the proposed MFSE-DRN model exhibited superior capability in the AD conversion probability prediction with the highest ACCs of 88.0, 85.5, and 88.4% within M12, M24, and M36. As shown in Fig. 1, the AUCs of the proposed model reached to 0.74, 0.90, and 0.94 for AD conversion probability within M12, M24, and M36, respectively.

Table 3 Ablation comparison of AD conversion probability (ACC %).

Full size table

Table 4 Performance comparison of AD conversion probability prediction (%).

Full size table

Discussion

In clinical practice, accurate predictions of clinical scores and conversion probability in AD play a crucial role in understanding the rate of disease progression and tailoring individualized treatment plans. The longitudinal prediction of AD demonstrated significant values in evaluating the progression of AD and preventing its onset. In this study, the proposed model exhibited robust performance in AD predictions of clinical scores and conversion probability. These findings highlight the potential of the model for aiding clinicians in making informed decisions and improving patient outcomes. This is expected to be applied to clinical research.

The combined features of sMRI, genetics, clinical scores, and biomarkers were confirmed to have high sensitivity and specificity for AD. The results showed that the fusion of multiple features ensured that the vital features were acquired during feature extraction and improved the model robustness. It was consistent with the previous findings that various types of data enhanced the model’s ability to learn more complementary information during training^36,37,38. Therefore, this study utilized multiple types of baseline data to betterment the effectiveness of the model. Due to the absence of the follow-up data for the CDRSB and RAVLT immediate scores of the participants in the data set, it was worthy of noting that our study did not predict these two clinical scores.

Different from the existing AD prediction models, our proposed model did have substantial innovations in three aspects. Firstly, the construction of the DRB could extract internal core characteristics of sMRI facilitating efficient model³⁹. Secondly, the incorporation of the SE structure enhanced the DRB’s ability to accurately capture feature correlations and maintain model stability when handing high-dimensional data⁴⁰. Thirdly, the inclusion of both max pooling and average pooling layers in the multifusion pooling (MF-Pool) block enhanced the model’s capability to extract and delineate nonlinear features while also reducing dimensionality. This enabled the neural network to effectively extract high-dimensional features. As shown in previous studies, the use of multimodal features provides abundant feature information to precisely reflect the pathological process of AD and thus enhances the performance of the model to accurately predict AD^41,42,43.

Our experimental results displayed that the prediction RMSEs and MAEs of clinical scores gradually increased along with the follow-up time points. This means that the prediction of clinical scores with typical brain morphology is more accurate in the early stages. This is because the hippocampal and internal olfactory cortical brain regions show a marked tendency to atrophy in the early stages of AD¹⁶. Similarly, it was validated that those cortices were highly correlated with the MMSE and ADAS in our experiment. Subsequently, the prediction error increased along with the trend of slow cognitive decline in the later stages of AD^24,44. According to the AD conversion probability, there existed obvious trends of increased ACCs and AUCs along with the varied predicted time points. It was due to the fact that the significant changes of brain regions produced at the later AD stages would help to improve the predication accuracy⁴⁵.

This study still had some limitations. Firstly, the enrolled samples from the public data set were very limited, this increased the likelihood of individual influence during data collection and might have even led to overfitting during the predicted tasks. Then, since no other available public data sets were provided, the validation of our model would resort to private and multi-center data sets. Finally, only one type of imaging modality of sMRI was used in our study, and the application of multiple imaging modalities of MR and/or PET is still worthy of trying and evaluating.

In future, more cases with multi-modal data should be involved to improve the performance of predictions. In addition, more kinds of DL strategies such as transfer learning and reinforcement learning should be integrated to improve the accuracy, and reliability of the models for AD prediction^46,47. Especially, the interpretability for proposed model would be investigated to improve the visualization and analysis of the features for the hidden layers.

Materials and methods

Materials

Enrolled participants

A total of 487 participants was enrolled from the Alzheimer’s disease neuroimaging initiative (ADNI) database (www.adni-info.org), including 111 patients with AD, 238 patients with MCI, and 138 normal controls. The subjects are primarily non-Hispanic white subjects. All enrolled participants met with the criteria of the national institute of neurological and communicative disorders and stroke/AD and related disorders association.

Data description

The data sets of sMRI, clinical scores (MMSE, ADAS-11, clinical dementia rating scale sum of boxes (CDRSB), and rey auditory verbal learning test (RAVLT) immediate, genetics (ApoE4), and biomarker features (Aβ42/Aβ40 antibodies, t-tau, and p-tau) at baseline were collected for each participant. And the demographic information of the enrolled participants is presented in Table 5. The chi-square tests validated that the variables such as sex, age, education years, MMSE, ADAS-11, CDRSB, and RAVLT immediate were individually independent and conformed to normal distribution among the three groups. Meanwhile, the clinical scores of MMSE and ADAS-11 at longitudinal time points of M06, M12, M24, and M36 were also collected for model evaluation, as shown in Table 6. The conversion cases of MCI groups were divided into two subgroups, pMCI and sMCI, as depicted in Table 7.

Table 5 The demographic information of enrolled subjects.

Full size table

Table 6 Clinical scores cases at different time points.

Full size table

Table 7 AD conversion cases in MCI group.

Full size table

All participants underwent MR scanning of the brain, which were acquired using 3 T scanners from Siemens, general electric (GE), and Philips at multiple sites. T1-weighted images were acquired, and the protocol parameters were listed as follows: 1.2 mm slice thickness; 256 × 256 scanning matrix; repetition time = 2300 ms; echo time = 2.98 ms; field of view = 240 × 240 mm²; flip angle = 90 degree; and 256 × 256 reconstruction matrix.

Methods

The pipeline of the multifeature AD prediction included three steps: data preprocessing, model construction, and experimental setup (Fig. 2). Firstly, sMRI, genetic features, biological features and clinical information were individually preprocessed; secondly, the MFSE-DRN using the blocks of SE-DRB, MF-Pool and multi-feature fusion was constructed; and finally, the clinical scores and AD conversion probabilities were predicted.

Data preprocessing

The data prepossessing contained two aspects: one for the sMRI data set, and another for clinical scores, genetic and biomarker data sets. The sMRI data preprocessing consisted of three steps: image registration, non-brain tissue segmentation, and image enhancement. It was performed using the computational anatomy toolbox 12 (CAT12) software package (https://www.nitrc.org/projects/cat/). Firstly, the sMRI images were registered using the diffeomorphic anatomical registration through exponentiated lie algebra (DARTEL) method to ensure brain structure alignment across individuals. Secondly, non-brain tissues, such as the skull and scalp, were removed using morphology-based methods. Thirdly, the image enhancement was achieved through histogram equalization, spatial filtering, and nonlinear transformation methods. Besides, the raw data sets of clinical scores, genes, and biomarkers were normalized due to the varied magnitude scales.

Model construction of MFSE-DRN

Based on the classical residual network (DRN), our modified model integrated three key modules of SE-DRB, MF-Pool, and multifeature fusion. As illustrated in Fig. 3, our model comprised of 36 layers, including 32 convolutional (Conv) layers (16 SE-DRBs), one 7 × 7 × 7 Conv layer, 2 MF-Pool layers, and one multifeature fusion layer. The data pipeline of MFSE-DRN is represented as pseudo-code, as shown in Table 8.

Table 8 MFSE-DRN algorithm.

Full size table

The SE-DRB was assembled by dilated convolution (DC) and squeeze-and-excitation (SE) blocks and enlarged the receptive convolution field for feature extraction. Then, the MF-Pool combined the max pooling and average pooling layers for extracting varying dimensions features. Furthermore, the multifeature fusion could effectively handle high-dimensional data. It concatenated concurrent features different levels of multiple features to extract the most representative features, and utilized the correlation and complementarity between different features to improve the accuracy of prediction.

The pipeline of our model was elucidated as follows: Firstly, the sMRI data was downsampled using a 7 × 7 × 7 Conv layer and a 3 × 3 × 3 MF-Pool layer. Next, the high-dimensional image features were extracted by the 16 SE-DRBs and outputted to the MF-Pool layer. At last, the image features were integrated with genetic features, clinical scores, and biological features, and the multi-features were connected with the fully connected (FC) layer. Since the proposed MF-Pool combined the max pooling and average pooling layers, and the capability of feature extraction at varied dimensions was effectively enhanced⁴⁸. The final output of the MF-Pool is formulated as follows:

$$Y\left( {n,c,h,\omega } \right) = \frac{1}{2}\left[ {Y_{1} \left( {n,c,h,\omega } \right) + Y_{2} \left( {n,c,h,\omega } \right)} \right]$$

(1)

$$Y_{1} \left( {n,c,h,\omega } \right) = \mathop {max}\limits_{{m \in \left[ {0,k_{h} - 1\left] {,n \in } \right[0,k_{\omega } - 1} \right]}} X\left( {n,c,hs + m,\omega s + n} \right)$$

(2)

$$Y_{2} \left( {n,c,h,\omega } \right) = \frac{1}{{k_{h} \times k_{\omega } }}\mathop \sum \limits_{m = 0}^{{k_{h} - 1}} \mathop \sum \limits_{n = 0}^{{k_{\omega } - 1}} X\left( {n,c,hs + m,ws + n} \right)$$

(3)

where Y is the final output of the MF-Pool; Y₁ is the output of the maximum pooling layer; Y₂ is the output of the average pooling layer; n is the number of feature maps; c is the number of channels; h is the number of rows; ω is the number of columns; s is the step size; k_h is the pooling window length; k_ω is the pooling window width.

With the improved DRB, it facilitated the feature extraction capability for additional features by enlarging the Conv receptive field in residual networks. In our model, the DRB was constructed by replacing the first layer of the base convolution in the base residual module. Here, the base convolution kernel k = 3, DC expansion rate d = 2, equivalent convolution kernel k’ = 5, and current sensory field RF_i+1 = 7.

Additionally, the use of residual structure in our network architecture can allow for deeper and more complex models to be trained, and the short-circuit connection can effectively avoid the overfitting problem caused by the model depth⁴⁹. In order to improve the model stability, the SE-DRB was built by integrating the SE with the DRB, as illustrated in Fig. 4. This made a more precise capture of feature correlations by assigning adaptive weights to features across various channels, thus significantly elevating the network performance.

In Fig. 5, the multimodal feature fusion block combined the image features extracted from the Conv layer with the genes, clinical scores, and biometric features, and acted on the FC layer to effectively extract multifeature information to achieve longitudinal prediction of AD.

Experimental setup

In this study, two AD predicted tasks of clinical scores and conversion probability were performed. The first was to predict the MMSE and ADAS-11 scores at M06, M12, M24, and M36, and the second task was to predict the conversion probability of patients with MCI progressed to AD at M12, M24, and M36, respectively.

For performance evaluation, our proposed model was quantitatively compared with two other novel models: ranking convolutional neural network (RCNN) and 3D vision geometrical group network (3D-VGGNet)^50,51. The ablation experiments were conducted by individually removing the three mentioned modules in the model, and five-fold cross-validation method with repeated random sampling is used.

During the experiments, The main hyperparameters were as follows: (1) during the clinical scores prediction, the number of training rounds was 150, the batch size was 8, the learning rate was 0.0001, the loss function was mean square error (MSE), and the optimizer was chosen as Adam; and (2) at the AD conversion probability prediction, the number of training rounds was 100, the batch size was 4, the learning rate was 0.0001, the loss function was Cross Entropy, and the Adam was chosen as optimizer. The regularization terms of L1 and L2 were implemented. Here, L1 produces a sparse matrix of weights and L2 allows the weights to decrease uniformly. This would ensure the model stability and avoid model overfitting.

Moreover, two metrics of MAE and RMSE were chosen for evaluating clinical scores prediction⁵², and five indices of ACC, precision, recall, F1, receiver operating characteristic, and AUC⁵³ were defined to assess AD conversion probability prediction⁵⁴.

The DL experiments were performed on a Windows 10 Pro workstation using an NVIDIA RTX 6000 graphics card. The CUDA version 11.3 and Python version 3.8 software were used with the DL framework of PyTorch.

Ethical approval

The present study was approved by The Shanghai University of Medicine and Health Sciences ethics review committee (approval number, 2019-GZR-06-142132197606243519). All research methods were conducted in strict accordance with relevant guidlines and regulations. We hereby confirm that informed consent was obtained from all subjects and/or their legal guardians who provided data.

Data availability

The datasets generated or analyzed during the study are available in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) repository, https://adni.loni.usc.edu/.

References

Sharma, R., Goel, T., Tanveer, M., Dwivedi, S. & Murugan, R. FAF-DRVFL: Fuzzy activation function based deep random vector functional links network for early diagnosis of Alzheimer disease. Appl. Soft Comput. 106, 107371. https://doi.org/10.1016/j.asoc.2021.107371 (2021).
Article Google Scholar
Hao, N. et al. Acoustofluidic multimodal diagnostic system for Alzheimer’s disease. Biosens. Bioelectron. 196, 113730. https://doi.org/10.1016/j.bios.2021.113730 (2022).
Article CAS PubMed Google Scholar
Niu, H., Álvarez-Álvarez, I., Guillén-Grima, F. & Aguinaga-Ontoso, I. Prevalence and incidence of Alzheimer’s disease in Europe: A meta-analysis. Neurología (English Edition) 32, 523–532. https://doi.org/10.1016/j.nrleng.2016.02.009 (2017).
Article CAS Google Scholar
Liss, J. et al. Practical recommendations for timely, accurate diagnosis of symptomatic Alzheimer’s disease (MCI and Dementia) in primary care: A review and synthesis. J. Intern. Med. https://doi.org/10.1111/joim.13244 (2021).
Article PubMed PubMed Central Google Scholar
Frederiksen, K. S., Gjerum, L., Waldemar, G. & Hasselbalch, S. G. Effects of physical exercise on Alzheimer’s disease biomarkers: A systematic review of intervention studies. J Alzheimers Dis. 61, 359–372. https://doi.org/10.3233/jad-170567 (2018).
Article CAS PubMed Google Scholar
Shen, T. et al. Decision supporting model for one-year conversion probability from MCI to AD using CNN and SVM. in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). 738–741, https://doi.org/10.1109/EMBC.2018.8512398 (2018).
Dos Santos Picanco, L. C. et al. Alzheimer’s disease: A review from the pathophysiology to diagnosis, new perspectives for pharmacological treatment. Curr. Med. Chem. 25(26), 3141–3159. https://doi.org/10.2174/0929867323666161213101126 (2018).
Article CAS PubMed Google Scholar
Huang, L., Jin, Y., Gao, Y., Thung, K.-H. & Shen, D. Longitudinal clinical score prediction in Alzheimer’s disease with soft-split sparse regression based random forest. Neurobiol. Aging 46, 180–191. https://doi.org/10.1016/j.neurobiolaging.2016.07.005 (2016).
Article PubMed PubMed Central Google Scholar
Simon, M. J. & Iliff, J. J. Regulation of cerebrospinal fluid (CSF) flow in neurodegenerative, neurovascular and neuroinflammatory disease. Biochimica et Biophysica Acta Mol. Basis Dis. 1862, 442–451. https://doi.org/10.1016/j.bbadis.2015.10.014 (2016).
Article CAS Google Scholar
Bell, R. D. et al. Apolipoprotein E controls cerebrovascular integrity via cyclophilin A. Nature 485, 512–516. https://doi.org/10.1038/nature11087 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Uddin, M. S. et al. APOE and Alzheimer’s disease: Evidence mounts that targeting APOE4 may combat Alzheimer’s pathogenesis. Mol. Neurobiol. 56, 2450–2465. https://doi.org/10.1007/s12035-018-1237-z (2019).
Article CAS PubMed Google Scholar
Pozueta, A. et al. Detection of early Alzheimer’s disease in MCI patients by the combination of MMSE and an episodic memory test. BMC Neurol. 11, 78. https://doi.org/10.1186/1471-2377-11-78 (2011).
Article PubMed PubMed Central Google Scholar
Zhang, X. et al. Metrological properties of neuropsychological tests for measuring cognitive change in individuals with prodromal Alzheimer’s disease. Aging Mental Health 26, 1988–1996. https://doi.org/10.1080/13607863.2021.1966746 (2022).
Article PubMed Google Scholar
Grundman, M. et al. Mild cognitive impairment can be distinguished from Alzheimer disease and normal aging for clinical trials. Arch. Neurol. 61, 59–66. https://doi.org/10.1001/archneur.61.1.59 (2004).
Article PubMed Google Scholar
Ito, K., Hutmacher, M. M. & Corrigan, B. W. Modeling of Functional Assessment Questionnaire (FAQ) as continuous bounded data from the ADNI database. J. Pharmacokinet. Pharmacodyn. 39, 601–618. https://doi.org/10.1007/s10928-012-9271-3 (2012).
Article CAS PubMed Google Scholar
Zhou, J., Liu, J., Narayan, V. A. & Ye, J. Modeling disease progression via multi-task learning. NeuroImage 78, 233–248. https://doi.org/10.1016/j.neuroimage.2013.03.073 (2013).
Article PubMed Google Scholar
Arafa, D. A. et al. A deep learning framework for early diagnosis of Alzheimer’s disease on MRI images. Multimed. Tools Appl. 83, 3767–3799. https://doi.org/10.1007/s11042-023-15738-7 (2024).
Article Google Scholar
Fathi, S. et al. A deep learning-based ensemble method for early diagnosis of Alzheimer’s disease using MRI images. Neuroinformatics 22(1), 89–105. https://doi.org/10.1007/s12021-023-09646-2 (2024).
Article PubMed Google Scholar
Helaly, H. A., Badawy, M. & Haikal, A. Y. Deep learning approach for early detection of Alzheimer’s disease. Cogn. Comput. 14(5), 1711–1727. https://doi.org/10.1007/s12559-021-09946-2 (2022).
Article Google Scholar
Shankar, V. G., Sisodia, D. S. & Chandrakar, P. An intelligent hierarchical residual attention learning-based conjoined twin neural network for Alzheimer’s stage detection and prediction. Comput. Intell. 39, 783–805. https://doi.org/10.1111/coin.12594 (2023).
Article Google Scholar
Lei, B. et al. Predicting clinical scores for Alzheimer’s disease based on joint and deep learning. Expert Syst. Appl. 187, 115966. https://doi.org/10.1016/j.eswa.2021.115966 (2022).
Article Google Scholar
Tabarestani, S. et al. A tensorized multitask deep learning network for progression prediction of Alzheimer’s disease. Front. Aging Neurosci. 14, 810873. https://doi.org/10.3389/fnagi.2022.810873 (2022).
Article CAS PubMed PubMed Central Google Scholar
Tabarestani, S., et al. Longitudinal prediction modeling of Alzheimer disease using recurrent neural networks. 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). pp 1–4. https://doi.org/10.1109/BHI.2019.8834556 (2019).
Liu, M., Zhang, J., Lian, C. & Shen, D. Weakly supervised deep learning for brain disease prognosis using MRI and incomplete clinical scores. IEEE Trans. Cybern. 50, 3381–3392. https://doi.org/10.1109/TCYB.2019.2904186 (2020).
Article PubMed Google Scholar
Zhang, D., Shen, D., Alzheimer’s Disease Neuroimaging Initiative. Predicting future clinical changes of MCI patients using longitudinal and multimodal biomarkers. PLOS ONE 7, e33182. https://doi.org/10.1371/journal.pone.0033182 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, M., Zhang, J., Adeli, E. & Shen, D. Deep Multi-task Multi-channel learning for joint classification and regression of brain status. Medical image computing and computer-assisted intervention: MICCAI International Conference on Medical Image Computing and Computer-Assisted Intervention. 10435, 3–11, https://doi.org/10.1007/978-3-319-66179-7_1 (2017).
Tangaro, S., Fanizzi, A., Amoroso, N. & Bellotti, R. A fuzzy-based system reveals Alzheimer’s Disease onset in subjects with Mild Cognitive Impairment. Phys. Med. 38, 36–44. https://doi.org/10.1016/j.ejmp.2017.04.027 (2017).
Article PubMed Google Scholar
Shu, Z.-Y. et al. Prediction of the progression from mild cognitive impairment to Alzheimer’s disease using a radiomics-integrated model. Ther. Adv. Neurol. Disord. 14, 17562864211029552. https://doi.org/10.1177/17562864211029551 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lin, W. et al. Predicting Alzheimer’s disease conversion from mild cognitive impairment using an extreme learning machine-based grading method with multimodal data. Front. Aging Neurosci. https://doi.org/10.3389/fnagi.2020.00077 (2020).
Article PubMed PubMed Central Google Scholar
Ezzati, A. et al. Optimizing machine learning methods to improve predictive models of Alzheimer’s disease. J. Alzheimer’s Dis. 71, 1027–1036. https://doi.org/10.3233/JAD-190262 (2019).
Article Google Scholar
Gaser, C. et al. BrainAGE in mild cognitive impaired patients: Predicting the conversion to Alzheimer’s disease. PLOS ONE 8, e67346. https://doi.org/10.1371/journal.pone.0067346 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Llano, D. A., Laforet, G. & Devanarayan, V. Derivation of a new ADAS-cog composite using tree-based multivariate analysis: Prediction of conversion from mild cognitive impairment to Alzheimer disease. Alzheimer Dis. Assoc. Disord. https://doi.org/10.1097/WAD.0b013e3181f5b8d8 (2011).
Article PubMed Google Scholar
Lee, G. et al. Predicting Alzheimer’s disease progression using multi-modal deep learning approach. Sci. Rep. 9, 1952. https://doi.org/10.1038/s41598-018-37769-z (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Spasov, S., Passamonti, L., Duggento, A., Liò, P. & Toschi, N. A parameter-efficient deep learning approach to predict conversion from mild cognitive impairment to Alzheimer’s disease. NeuroImage 189, 276–287. https://doi.org/10.1016/j.neuroimage.2019.01.031 (2019).
Article PubMed Google Scholar
Lu, D. et al. Multimodal and multiscale deep neural networks for the early diagnosis of Alzheimer’s disease using structural MR and FDG-PET images. Sci. Rep. 8, 5697. https://doi.org/10.1038/s41598-018-22871-z (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Guan, H., Wang, C. & Tao, D. MRI-based Alzheimer’s disease prediction via distilling the knowledge in multi-modal data. NeuroImage 244, 118586. https://doi.org/10.1016/j.neuroimage.2021.118586 (2021).
Article PubMed Google Scholar
Prabhu, S. S. et al. Multi-modal deep learning models for Alzheimer’s disease prediction using MRI and EHR. 2022 IEEE 22nd International Conference on Bioinformatics and Bioengineering (BIBE). 168–173, https://doi.org/10.1109/BIBE55377.2022.00044 (2022).
Ismail, W. N. et al. A meta-heuristic multi-objective optimization method for Alzheimer’s disease detection based on multi-modal data. Mathematics 11(4), 957. https://doi.org/10.3390/math11040957 (2023).
Article Google Scholar
Gu, Z. et al. CE-Net: Context encoder network for 2D medical image segmentation. IEEE Trans. Med. Imaging. 38, 2281–2292. https://doi.org/10.1109/TMI.2019.2903562 (2019).
Article PubMed Google Scholar
Hu, J., Shen, L., Albanie, S., Sun, G. & Wu, E. Squeeze-and-excitation networks. IEEE Trans. Pattern Anal. Mach. Intell. 42, 2011–2023. https://doi.org/10.1109/TPAMI.2019.2913372 (2020).
Article PubMed Google Scholar
Payan, A. & Montana, G. Predicting Alzheimer’s disease: A neuroimaging study with 3D convolutional neural networks. ICPRAM 2015 - 4th International Conference on Pattern Recognition Applications and Methods, Proceedings. 2, (2015).
Zhang, T. & Shi, M. Multi-modal neuroimaging feature fusion for diagnosis of Alzheimer’s disease. J. Neurosci. Methods. 341, 108795. https://doi.org/10.1016/j.jneumeth.2020.108795 (2020).
Article PubMed Google Scholar
Minhas, S. et al. Early MCI-to-AD conversion prediction using future value forecasting of multimodal features. Comput. Intell. Neurosci. 2021, 6628036. https://doi.org/10.1155/2021/6628036 (2021).
Article PubMed PubMed Central Google Scholar
Galasko, D. R., Gould, R. L., Abramson, I. S. & Salmon, D. P. Measuring cognitive change in a cohort of patients with Alzheimer’s disease. Stat. Med. 19, 1421–1432. https://doi.org/10.1002/(SICI)1097-0258(20000615/30)19:11/12%3c1421::AID-SIM434%3e3.0.CO;2-P (2000).
Article CAS PubMed Google Scholar
Hinrichs, C., Singh, V., Xu, G. & Johnson, S. C. Predictive markers for AD in a multi-modality framework: An analysis of MCI progression in the ADNI population. NeuroImage 55, 574–589. https://doi.org/10.1016/j.neuroimage.2010.10.081 (2011).
Article PubMed Google Scholar
Lian, C., Liu, M., Pan, Y. & Shen, D. Attention-guided hybrid network for dementia diagnosis with structural MR images. IEEE Trans. Cybern. 52, 1992–2003. https://doi.org/10.1109/TCYB.2020.3005859 (2022).
Article PubMed PubMed Central Google Scholar
Duc, N. T. et al. 3D-deep learning based automatic diagnosis of Alzheimer’s disease with joint MMSE prediction using resting-state fMRI. Neuroinformatics 18, 71–86. https://doi.org/10.1007/s12021-019-09419-w (2020).
Article PubMed Google Scholar
Yang, J. & Li, J. Application of deep convolution neural network. 2017 14th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP). 229–232, https://doi.org/10.1109/ICCWAMTIP.2017.8301485 (2017).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp 770–778, https://doi.org/10.1109/CVPR.2016.90 (2016).
Qiao, H., Chen, L. & Zhu, F. Ranking convolutional neural network for Alzheimer’s disease mini-mental state examination prediction at multiple time-points. Comput. Methods Progr. Biomed. 213, 106503. https://doi.org/10.1016/j.cmpb.2021.106503 (2022).
Article Google Scholar
Hridayami, P., Putra, I. & Wibawa, K. Fish species recognition using VGG16 Deep convolutional neural network. J. Comput. Sci. Eng. 13, 124–130. https://doi.org/10.5626/JCSE.2019.13.3.124 (2019).
Article Google Scholar
Murdaca, G. et al. Vitamin D and folate as predictors of MMSE in Alzheimer’s disease: A machine learning analysis. Diagnostics 11(6), 940. https://doi.org/10.3390/diagnostics11060940 (2021).
Article CAS PubMed PubMed Central Google Scholar
Huang, H. et al. Voxel-based morphometry and a deep learning model for the diagnosis of early Alzheimer’s disease based on cerebral gray matter changes. Cereb. Cortex 33(3), 754–763. https://doi.org/10.1093/cercor/bhac099 (2023).
Article PubMed Google Scholar
Memiş, S., Enginoğlu, S. & Erkan, U. Fuzzy parameterized fuzzy soft k-nearest neighbor classifier. Neurocomputing 500, 351–378. https://doi.org/10.1016/j.neucom.2022.05.041 (2022).
Article Google Scholar

Download references

Funding

This study was funded by the National Natural Science Foundation of China (Nos. 61971275, 81830052, and 82072228), the grants of the National Key Research and Development Program of China (2020YFC2008700) and Shanghai Municipal Commission of Science and Technology for Capacity Building for Local Universities (23010502700). The data collection and sharing for the project were funded by the ADNI (National Institutes of Health Grant No. U01 AG024904).

Author information

These authors contributed equally: Zengbei Yuan and Xinlin Li.

Authors and Affiliations

College of Medical Imaging, Jiading District Central Hospital Affiliated Shanghai University of Medicine and Health Sciences, Shanghai, 201318, China
Zengbei Yuan, Xinlin Li, Zezhou Hao, Zhixian Tang, Xufeng Yao & Tao Wu
School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
Zengbei Yuan, Xinlin Li & Zezhou Hao

Authors

Zengbei Yuan
View author publications
Search author on:PubMed Google Scholar
Xinlin Li
View author publications
Search author on:PubMed Google Scholar
Zezhou Hao
View author publications
Search author on:PubMed Google Scholar
Zhixian Tang
View author publications
Search author on:PubMed Google Scholar
Xufeng Yao
View author publications
Search author on:PubMed Google Scholar
Tao Wu
View author publications
Search author on:PubMed Google Scholar

Contributions

Z.Y. and X.L. conceptualized and designed the methodology. Z.Y., X.L., Z.H. and Z.T. conducted the investigation. X.Y. and T.W. administered the project. Z.Y. and X.L. developed AI, including creating algorithms, learning, and visualization. X.L., Z.H. and Z.T. validated the results. Z.Y. and X.L. performed the statistical analysis. X.Y. provided supervision. X.Y. acquired funding. All authors contributed to writing the original draft, reviewed the manuscript, and approved the final version before submission.

Corresponding author

Correspondence to Xufeng Yao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yuan, Z., Li, X., Hao, Z. et al. Intelligent prediction of Alzheimer’s disease via improved multifeature squeeze-and-excitation-dilated residual network. Sci Rep 14, 11994 (2024). https://doi.org/10.1038/s41598-024-62712-w

Download citation

Received: 20 December 2023
Accepted: 21 May 2024
Published: 25 May 2024
DOI: https://doi.org/10.1038/s41598-024-62712-w

This article is cited by

AlzONet: a deep learning optimized framework for multiclass Alzheimer’s disease diagnosis using MRI brain imaging
- Hiba A. Alahmed
- Ghaida A. Al-Suhail
The Journal of Supercomputing (2025)