An international multi-cohort investigation of self-reported sleep and future depressive symptoms in older adults

Wallace, Meredith L.; Oryshkewych, Nina; Hoepel, Sanne J.W.; Buysse, Daniel J.; Mentch, Lucas; Butters, Meryl A.; Stone, Katie L.; Yaffe, Kristine; Barnes, Lisa L.; Lim, Andrew S.; Ensrud, Kristine E.; Paudel, Misti L.; Luik, Annemarie

doi:10.1038/s41598-025-07864-z

Download PDF

Original Research
Open access
Published: 04 July 2025

An international multi-cohort investigation of self-reported sleep and future depressive symptoms in older adults

Meredith L. Wallace^1,2,
Nina Oryshkewych³,
Sanne J.W. Hoepel⁴,
Daniel J. Buysse¹,
Lucas Mentch²,
Meryl A. Butters¹,
Katie L. Stone^5,12,
Kristine Yaffe⁶,
Lisa L. Barnes⁷,
Andrew S. Lim⁸,
Kristine E. Ensrud⁹,
Misti L. Paudel¹⁰ &
…
Annemarie Luik ORCID: orcid.org/0000-0001-7517-197X^7,11

Scientific Reports volume 15, Article number: 23918 (2025) Cite this article

83 Accesses
Metrics details

Subjects

Abstract

Poor subjective sleep is associated with future depression in older adults, but there is limited consensus on which sleep features have the strongest associations. Moreover, composite scores incorporating multiple features may better represent sleep burden than individual sleep items. We analyzed older adults (age ≥ 60) without clinically relevant depressive symptoms from a multi-cohort United States sample (US; N = 4826) and the Netherlands’ Rotterdam Study (RS; N = 3663), with the goal of identifying individual and composite sleep features that are associated with future clinically relevant depressive symptoms 3–6 years later. Sleep-related daytime symptoms (Risk Ratio [95% CI] 2.10 [1.58, 2.80] in US; 2.10 [1.40, 3.14] in RS) and difficulty falling asleep (1.87 [1.49, 2.35] in US; RS = 1.90 [1.50, 2.43] in RS) were the strongest individual sleep features. Moreover, the combination of these features was most impactful (3.32 [2.33, 4.73] in US; 3.19 [2.64, 3.86] in RS), providing the largest effect size with the fewest number of items. Future studies should assess whether screening tools incorporating these features, paired with targeted sleep treatment, could reduce rates of incident depression in older adults. Examining mechanisms underlying these associations could improve the effectiveness of sleep-related treatments in older adults.

Sleep habits are associated with cognition decline in physically robust, but not in frail participants: a longitudinal observational study

Article Open access 08 July 2022

Sleep characteristics across the lifespan in 1.1 million people from the Netherlands, United Kingdom and United States: a systematic review and meta-analysis

Article 16 November 2020

A multi-institutional cohort study on risk of sleep disorders in dry eyes patients using TriNetX

Article Open access 11 April 2025

Introduction

Depression is a leading cause of disability among older adults, yet the world’s aging population continues to have limited depression screening and prevention resources¹. Brief and scalable screening tools indicating which older adults are at risk for future depression, followed by interventions targeting depression-specific pathways, may be beneficial. Self-reported sleep characteristics could contribute meaningfully to such tools and interventions. Up to 90% of people with major depressive disorder report sleep-related symptoms² and studies link sleep to underlying depression pathophysiology³. Although sleep is known to have bi-directional associations with depression, experimental evidence suggests a potential causal relationship³. Among healthy individuals, experimental manipulation of sleep increases negative affect⁴, and among people with depression, clinical trials demonstrate that therapeutic sleep interventions reduce depressive symptoms⁵.

Despite the strong links between sleep and depression, the multi-dimensional nature of sleep is a barrier to determining which self-reported sleep features should be considered in future investigations aimed at enhancing screening and prevention tools. Insomnia symptoms and diagnoses can more than double the risk for future depression^3,6. The ‘insomnia with short sleep’ phenotype is reported to have a stronger association with depression than insomnia alone⁷, although often this phenotype is defined based on objective (rather than self-reported) sleep duration. Sleep apnea frequently co-occurs with depression and may also triple the risk of future depression⁸. Studies have also considered whether ‘sleep health’ characteristics (e.g., Satisfaction, Alertness/Sleepiness, Timing, Efficiency, and Duration; abbreviated ‘SATED’), either individually or in a composite score, predict depressive symptoms in older adults. For example, in older women, extreme daytime sleepiness was the strongest individual sleep health characteristic and doubled the risk for depression, but reporting ≥ 3 adverse sleep health characteristics tripled the risk for future depression⁹.

Another challenge in the study of sleep and depression—and health research more broadly—is that extant findings are based on studies with different populations, obscuring our ability to evaluate which findings are generalizable versus population-specific. Rates and predictors of incident depression may differ by demographic and health factors¹⁰; however, potential confounders are not consistently measured or incorporated in analyses performed across different studies. Additional sources of heterogeneity include different analytic approaches and small sample sizes. Analyses that aggregate data at the person level, rather than the study level, address many of these limitations: they allow for covariation of important individual differences, standardize analytic procedures, and boost power for moderation analysis needed to examine differential associations¹¹.

In the current study, we analyzed data from a harmonized multi-cohort United States sample (‘US Sample’) and, separately, from the Rotterdam Study from The Netherlands (‘RS Sample’). Participants did not have clinically relevant depressive symptoms at an initial visit and were reassessed for depressive symptoms 3–6 years later. Leveraging these samples, our specific aims were to: (1) determine which individual self-reported sleep measures have the strongest associations with future clinically relevant depressive symptoms; (2) develop and validate composite sleep scores that are associated with future clinically relevant depressive symptoms; and (3) explore whether findings differ across subgroups stratified by sex, race, age and cohort.

Methods

Cohorts and analytic samples

We harmonized and aggregated four cohorts of older adults from the US: the Study of Osteoporotic Fractures¹² (SOF), the Osteoporotic Fractures in Men Study¹³ (MrOS), the Memory and Aging Study¹⁴ (MAP), and the Minority Aging Research Study¹⁵ (MARS). We also developed a Rotterdam Study¹⁶ (RS) sample, harmonized to have identical variable names and coding as the US sample. RS and US samples were kept separate because current European data sharing laws generally prohibit RS from being directly analyzed within US-based institutions. Our harmonization procedures across these cohorts were reported previously¹⁷. Briefly, content experts were given list of all items and then used an iterative process to group them into conceptual domains and subdomains. Within (sub)domains, we selected up to one item from each cohort and then recoded the items to be comparable across cohorts. Finally, experts not involved in the harmonization process rated the perceived harmonizability of the sleep items.

To select the initial visit and accompanying follow-up visit for MrOS, SOF, and RS, we balanced several factors including the follow-up interval length, cohort age, data availability, and harmonizability. These considerations resulted in initial visits and follow-up visits that were 3–6 years apart. Because MAP and MARS have annual assessments, for each participant we selected the initial visit as the first observation year with complete sleep data, and the follow-up visit as the observation year that was closest to, but not beyond, 6 years from the selected initial visit.

Inclusion criteria for the US and RS analytic samples were: (1) age ≥ 60; (2) no known or suspected dementia at the initial visit; (3) complete data on self-report sleep characteristics, depressive symptoms, and antidepressant use at the initial visit (i.e., no missing data related to sleep or the outcomes of interest); (4) complete data for depressive symptoms at the follow-up visit; and (5) no clinically relevant depressive symptoms at the initial visit. All participants provided informed consent and Institutional Review Boards at respective institutions approved each study.

Further cohort information and details of how inclusion/exclusion criteria were defined in each cohort are provided in Supplementary Tables 1–2. A flow chart depicting sample derivations is provided in Supplementary Fig. 1.

Measures

Outcome

‘Clinically relevant depressive symptoms’ was selected as our primary outcome because it is interpretable, clinically relevant and harmonizable. Three depression scales were used across the cohorts: the 10-item Center for Epidemiologic Studies Depression Scale¹⁸ (CESD-10; MAP and MARS), the 20-item CESD¹⁹ (RS), and the 15-item Geriatric Depression Scale (GDS)²⁰ (MrOS and SOF). Based on prior studies’ determinations of optimal cutoffs for the diagnosis of major depressive disorder, the presence of clinically relevant depressive symptoms at follow-up was indicated by a GDS score ≥ 6²⁰, a CESD-10 score ≥ 4²¹ or a 20-item CESD ≥ 16¹⁹.

Self-reported sleep

We selected nine harmonized self-reported sleep health and sleep disorder symptom characteristics that were potentially relevant for future depressive symptoms: ‘sleep quality’, ‘daytime symptoms’, ‘midpoint timing’, ‘sleep efficiency’, ‘sleep duration’, ‘difficulty falling asleep’, ‘difficulty staying asleep’, ‘frequency of snoring’ and ‘frequency of stopping breathing during sleep’. All self-reported sleep characteristics were rigorously harmonized across cohorts and previously judged to have high harmonizability, except ‘daytime symptoms’, which had moderate harmonizability¹⁷. The ‘daytime symptoms’ sleep characteristic was more heterogeneously measured, with cohort items inquiring about tiredness and fatigue, difficulty staying awake, and sleep problems hindering daytime activities. Because different cohorts administered these different items, we did not have information on the degree to which participants viewed these various daytime symptoms as being similar; thus, we take a conservative approach and refer to them only as daytime symptoms. For interpretability and comparability, we categorized each sleep characteristic based on potentially adverse levels for older adults. Whenever possible, definitions were based on pre-existing and published cutoffs²². Otherwise, we used the frequency distributions and clinical content to indicate potentially adverse levels of sleep characteristics. See Table 1 for definitions of harmonized sleep measures and their cut-offs and Supplementary Table 3 for the original cohort item wording of each item.

Table 1 Individual and composite sleep features.

Full size table

We considered five self-reported sleep composite scores derived from the combinations of the nine categorical sleep characteristics: ‘All-Unweighted’, ‘All-Weighted’, ‘SATED’, ‘Selected’, and ‘Insomnia with Short Sleep’. The All-Unweighted and All-Weighted scores incorporate the full set of sleep features, with the former a simple sum of sleep indicators and the latter a sum incorporating weights derived in the other (external) sample (i.e. weights for US score derived in RS, and vice versa). The Selected score was derived to offer a composite score requiring only the sleep items with the strongest associations with depression, selected based model results from on the other sample. SATED and Insomnia with Short Sleep are composite sleep indices frequently considered in sleep literature^7,9. Composite score definitions and derivations are provided in Table 1, with technical details in Supplementary Text 1 and Supplementary Table 5.

Covariates

Education, marital status, cohort (US only), sex, race (US only), age, and follow-up time were considered potential confounders and included in all models. These are denoted as base covariates. Smoking status, alcohol use, body mass index, number of physical health comorbidities (considering stroke, thyroid disease, heart attack/congestive heart failure, hypertension, diabetes), use of sedating medications, and use of non-sedating antidepressant medications were hypothesized to be either confounders or theoretical mediators¹⁰. As the design of our study does not allow for formal differentiation of confounders versus mediators, we refrain from distinguishing their role and instead refer to them as secondary covariates.

While a variety of medications can have subtle effects on sleep, we focused on those with well-described and consistent effects on sleep and/or mood when defining specific medications to be included in the ‘sedating’ and ‘non-sedating antidepressant’ categories. Sedating medications included any tricyclic antidepressant, mirtazapine, nefazodone, non-benzodiazepine non-barbituate sedative hypnotic medications, or trazodone, coded from individual lists of drugs. Non-sedating antidepressant medications included any antidepressant except tricyclic antidepressants, mirtazapine, or nefazodone. In the US cohort, medications were collected via visual assessment of medication containers brought to the visit. In the RS cohort, medication use was determined based on pharmacy dispensary records.

Four covariates in the US sample each had up to < 0.5% missing data. Covariate missingness in RS was < 0.5%, except for education (1.37%) and BMI (2.5%). Missing covariate values were imputed using the MissForest package in R. Further details of categorization and measurement of all covariates are provided in Table 2 and Supplementary Table 4.

Table 2 Sample characteristics for the harmonized U.S. sample and the RS sample.

Full size table

Statistical analyses

Variables in the US and RS cohorts were harmonized so that identical code could be directly applied to each dataset, except for RS code excluding indicators of race and cohort. For all analyses, code was originally developed at the University of Pittsburgh for the US cohort and then sent to Erasmus MC University Medical Center to be run independently on the RS cohort. In preliminary analyses, we used descriptive statistics to assess sleep and covariate distributions in full and stratified samples and explored Spearman correlations among sleep indicators. Across all analyses, our focus was on interpreting effect sizes and 95% confidence intervals for inference. When examining sleep and depression associations, we considered a Risk Ratio (RR) of 1.86 as a guiderail to indicate a potentially moderate effect size²³. To reduce family-wise error, we used Benjamini Hochberg multiple comparison corrections across tests within each of our three aims to underscore the most robust findings. R Studio 2023.09.1 was used for all analyses.

Aims 1 and 2: individual sleep characteristics and composite scores

We used generalized linear models with a log link (i.e., Poisson regression) and robust standard errors to regress clinically relevant depressive symptoms at follow-up on each individual sleep feature (Aim 1) and each composite sleep score (Aim 2), adjusting for base covariates. This approach produces RR estimates that are interpretable and robust for low-incidence outcomes. A separate model was fit for each sleep characteristic or score.

For each individual sleep item and composite score, we used contrasts to estimate RRs across the observed range, with the reference group set at zero (optimal sleep health based on that characteristic or score). Given our interest in health screening and reducing false negatives, we examined the sensitivity associated with each model in Aims 1 and 2 plus a model including only base covariates. However, for completeness, we also secondarily report specificity and accuracy. When computing these performance metrics, we indicated a person was ‘positive’ for future clinically relevant depressive symptoms if their predicted probability was > 0.07, the average rate of future clinically relevant depressive symptoms across US and RS samples.

We performed three sets of sensitivity analyses for Aims 1 and 2: (1) allowing for new use of any antidepressant medication to count as incident depressive symptoms, and thus excluding participants using sedating or non-sedating antidepressants at the initial visit; (2) removing the sleep item from the CESD-10 and CESD-20 (the GDS does not include a sleep item); and (3) adding secondary covariates to the models to assess whether effect sizes remained consistent.

Exploratory aim 3: moderation

Using Poisson regression, we explored whether sex, age, race (US only), and cohort (US only) moderated the association of individual and composite sleep measures with future clinically relevant depressive symptoms. These analyses are exploratory because we generally did not have specific hypotheses about which demographic profiles would have better or worse sleep health for each sleep score or sleep characteristic. We also ran models within samples stratified by age, sex, race (US only), and cohort (US only), adjusting for base covariates.

Results

Preliminary descriptive analyses

The US sample is older than the RS sample by about 10 years. In the US sample, 15% of people identify as Black or African American, while RS primarily includes people of Dutch ancestry without self-reported race. Relative to RS, US participants have a shorter median follow-up time and higher prevalences of daytime symptoms, low sleep efficiency, difficulty staying asleep, and stopping breathing during sleep. In contrast, RS participants have greater prevalences of late timing and snoring. We observed low-to-moderate correlations among individual sleep characteristics (median [Q1, Q3] magnitude = 0.07 [0.03, 0.48] in RS; 0.05 [0.02, 0.12] in US). Tables 2 and 3 show full clinical, sociodemographic and sleep characteristics for the US and RS samples. Supplementary Tables 6–11 include demographic and sleep characteristics stratified by age, sex, race (US only) and cohort (US only).

Table 3 Self-report sleep measures in US sample and stratified samples.

Full size table

Aims 1 and 2: individual sleep characteristics and composite scores

Figure 1 provides RRs (95% CIs) for all individual sleep characteristics in the RS and US samples. In both the US and RS samples, difficulty falling asleep and daytime symptoms had moderate associations with future clinically relevant depressive symptoms. Poor quality, short duration, and difficulty staying asleep were also associated with future clinically relevant depressive symptoms in both samples, but with smaller effect sizes. In the US sample only, stopping breathing during sleep and late midpoint increased risk for future clinically relevant depressive symptoms.

All five composite sleep scores were associated with future clinically relevant depressive symptoms after adjusting for multiple comparisons; however, only four of these scores (Selected, SATED, All-Weighted, and All-Unweighted) had observed levels that tripled risk in either the US or RS sample (Fig. 2). Compared to participants endorsing zero items, endorsing two out of two (2/2) items in the Selected score was associated with a RR (95% CI) of 3.32 (2.33, 4.73) in the US sample and 3.19 (2.11, 4.82) in the RS sample. Endorsing 5/5 items on the SATED score was associated with a RR (95% CI) of 3.33 (2.02, 5.49) in the US sample and 2.65 (1.61, 4.38) in the RS sample. Endorsing 7/9 items on the All-Unweighted score was associated with a RR (95% CI) of 4.01 (2.50, 6.43) in the US sample and 3.16 (1.97, 5.06) in the RS sample. A value of 7/9 on the All-Weighted score was associated with a RR (95% CI) of 3.35 (2.26, 4.98) in the US sample and 3.10 (2.05, 4.69) in the RS sample. The percentages of participants endorsing these levels in the US and RS samples are provided in Table 3. The Selected score had the highest sensitivity for predicting future clinically relevant depressive symptoms (0.724 and 0.590 in US and RS, respectively), followed closely by difficulty falling asleep (0.712 and 0.586). Adding either the Selected score or difficulty falling asleep to the base model increased sensitivity by 11–12%. All composite score contrasts and performance results are provided in Supplementary Tables 12–13.

Across all three planned sets of sensitivity analyses, difficulty falling asleep and daytime symptoms remained strong individual sleep characteristics. Poor quality and short sleep duration were also repeatedly identified as predictive individual sleep characteristics. When considering composite scores, the Selected score had the strongest effect size while requiring the fewest items, as endorsing both items consistently produced RRs > 3.0, while other scores required endorsing at least five items to reach the same effect size. Full sensitivity analysis results are provided in Supplementary Figs. 2–7.

Exploratory aim 3: moderation

The only formal interaction passing multiple comparison corrections was between sex and stopping breathing in the US sample (RR [95% CI] 4.23 [1.95, 9.16] for sex-by-stopping breathing interaction). Accordingly, in stratified US analyses we observed a large effect of stopping breathing during sleep in women (RR [95%CI] 4.77 [2.73, 8.35]) but no association in men (1.23 [0.73, 2.10]). The All-Unweighted composite score (which includes stopping breathing) was slightly stronger among women than men in the US sample, although this interaction was not statistically significant (RR [95% CI] 1.25 [1.14,1.36] in women; 1.19 [1.08, 1.30] in men). In RS, a small effect size for stopping breathing during sleep and future clinically relevant depression symptoms was observed in women (1.52 [0.87, 2.64]) but not in men (0.94 [0.55, 1.61]), although this difference was not statistically significant. See Fig. 3 for RRs (95% CIs) of individual sleep characteristics by sex, and Supplementary Fig. 12 for RRs (95% CIs) of composite sleep scores by sex.

Although we did not observe any statistically significant sleep-by-race interactions, four of the five SATED sleep health components (daytime symptoms, poor quality, short duration, and late midpoint) were associated with future clinically relevant depressive symptoms among White US participants but not among Black US participants. Accordingly, the SATED score was related to future depressive symptoms among White participants (RR [95% CI] 1.33 [1.19, 1.47]) but not among black participants (RR [95% CI] 1.06 [0.85, 1.33]). Conversely, stopping breathing was strongly associated with future clinically relevant depressive symptoms among Black participants (RR [95% CI] 4.19 [1.84, 9.54]) but not among White participants (RR [95% CI] 1.50 [0.93, 2.43]). Plots illustrating results for all stratified analyses are provided in Supplementary Figs. 8–14.

Discussion

In large samples of non-depressed older adults from the United States and the Netherlands, we investigated nine self-reported sleep features, along with five specific combinations of these features, to determine which have the strongest associations with future clinically relevant depressive symptoms 3–6 years later. Difficulty falling asleep and sleep-related daytime symptoms had the largest effect sizes across samples. Older adults endorsing either of these features had approximately double the risk of future depressive symptoms, while those endorsing both features had approximately triple the risk.

Difficulty falling asleep and sleep-related daytime symptoms have each previously emerged as predictors of depression⁶. Their combination overlaps with the clinical diagnosis of insomnia, which is indicated by distress related to falling asleep, staying asleep, or early morning awakenings, with resulting daytime impairments. These daytime impairments can manifest heterogeneously and include a wide range of symptoms including fatigue, impaired performance, mood disturbance, reduced motivation, and sleepiness²⁴. Although the two features we identified are components of the insomnia diagnosis, other common insomnia symptoms including sleep efficiency and difficulty staying asleep did not consistently emerge as predictors of future depressive symptoms. The specific combination of difficulty falling asleep and daytime symptoms may signify an insomnia subtype that is experiencing a vicious circle of problems. Difficulty falling asleep at night may result in daytime sleepiness or fatigue, which in turn can reinforce difficulty falling asleep the subsequent night. Circular symptoms such as these may represent a form of conditioned arousal at bedtime that, in combination with sleep difficulty, could lead to affective dysregulation and depression²⁵. This insomnia subtype should be examined in future studies to assess its potential predictive value for other health outcomes and other populations. Interventions such as Trans-C²⁶, which considers a wider spectrum of sleep health problems beyond standard insomnia symptoms, may be beneficial for stopping this vicious sleep cycle and potentially preventing future depression, although further research is warranted.

Our large and heterogenous sample allowed us to examine which findings were most similar or different across demographic strata. Difficulty falling asleep was among the most robust features identified, as it was associated with clinically relevant depressive symptoms across nearly all stratified samples, as well as all three sensitivity analyses. Conversely, stopping breathing during sleep showed much stronger associations with future depressive symptoms among women and Black participants than among men and White participants. Although the gold standard for diagnosis of sleep apnea is polysomnography, stopping breathing during sleep is a frequent clinical complaint related to sleep apnea. Previous studies have suggested that women with sleep apnea are at higher risk of future depression²⁷, but few studies have directly examined the role of race in the relationship between sleep apnea and depression in older adults. However, among Black older adults, sleep apnea enhances vulnerability to other adverse health outcomes, including stroke^28,29. As we do not consistently have sleep apnea characterized with polysomnography across cohorts, we could not examine this relationship in more detail; however, such analyses will be an important future direction.

Our investigation of composite sleep scores suggested that the combination of daytime symptoms and difficulty falling asleep may be an efficient and scalable way to identify people at risk for depression, especially in the US sample. Adding seven additional sleep features (for a total of nine features in the All-Unweighted score) provided some added benefit in effect sizes for those with very high scores, although in our sample ≤ 1% endorsed these higher levels. In a sample with poorer sleep health, it is possible that there may be a greater benefit to adding these additional questions. It is noteworthy that the weighted version of the overall score did not perform better than a simple sum of the items. However, studies with other outcomes, features and samples may reach different conclusions regarding the value of weighted versus unweighted composite scores. It is also likely that using machine learning models accounting for complex interactions among features could further enhance effect sizes, as we have shown previously using random forests³⁰.

Major strengths of our study include the large sample sizes, analyses incorporating multiple demographically distinct cohorts, self-reported items that can be incorporated into clinical care, comprehensive variable harmonization, rigorous external validation to enhance generalizability outside the specific data considered here, and careful adjustment for potential confounders. Limitations are related to the observational nature of the study and some differences in the wording of the ‘daytime symptoms’ item across US cohorts and RS study, with cohort items differentially emphasizing energy, enthusiasm, or fatigue. This heterogeneity in ‘daytime symptoms’ items presents a challenge to the field of sleep more broadly, extending beyond our own work¹⁷. Along these same lines, different cohorts used different depression scales, limiting our ability to create a rigorous and harmonized continuous depression measure. Instead, we used a dichotomous depression measure, which is less sensitive than a continuous measure and which precluded the ability to adjust for subthreshold levels of depression at the initial sleep visit. Despite these cohort differences, findings were relatively consistent across cohorts. Finally, although our sample does not include people with clinically relevant depressive symptoms, we cannot rule out the possibility that the sleep characteristics we identified represent a prodromal or subthreshold form of depression.

Our work lays the foundation for several future directions. Investigating the mechanisms linking difficulty falling asleep and daytime symptoms with future depression is an important next step. Plausible pathophysiological pathways include inflammatory cytokines, neural circuit dysfunction, and neurochemical pathways related to monoamines, orexins and other neurotransmitters³. Further characterizing older adults based on actigraphy and polysomnography may add depth to our findings, as these objective measures inherently quantify different aspects of sleep than subjective self-report. However, self-reported sleep in particular may provide useful insight into depression risk precisely because its subjective nature reflects subtle aspects of sleep that cannot be detected with physiological and behavioral measures³¹. For instance, self-report can reflect affective and cognitive arousal in a way that wrist movements do not. Finally, future studies must assess whether early identification (e.g., using the combination difficulty falling asleep and daytime symptoms), followed by sleep treatment to address these challenges, can reduce the risk of future depression onset in older adults.

Data availability

Original cohort data for MAP, MARS, MrOS, and SOF are available for request through the Rush Alzheimer’s Disease Center (radc.rush.edu), MrOS Online (www.mrosonline.ucsf.edu), SOF online (www.sofonline.ucsf.edu), and through the National Sleep Research Resource (www.sleepdata.org). Original Rotterdam Study data can be obtained upon request; however, because of restrictions based on privacy regulations and informed consent of the participants, data cannot be made freely available in a public repository. The statistical code is posted on GitHub at https://github.com/oryshkewychns/SR_Sleep_Depression.

References

McCall, W. V. & Kintziger, K. W. Late life depression: A global problem with few resources. Psychiatr. Clin. North Am. 36(4), 475–481. https://doi.org/10.1016/j.psc.2013.07.001 (2013).
Article Google Scholar
Buysse, D. J. et al. Clinical diagnoses in 216 insomnia patients using the International classification of sleep disorders (ICSD), DSM-IV and ICD-10 categories: A report from the APA/NIMH DSM-IV field trial. Sleep 17(7), 630–637. https://doi.org/10.1093/sleep/17.7.630 (1994).
Article CAS Google Scholar
Fang, H., Tu, S., Sheng, J. & Shao, A. Depression in sleep disturbance: A review on a bidirectional relationship, mechanisms and treatment. J. Cell Mol. Med. 23(4), 2324–2332. https://doi.org/10.1111/jcmm.14170 (2019).
Article PubMed Central Google Scholar
Campbell, R. L., Feldner, M. T. & Leen-Feldner, E. W. An experimental test of the effects of acute sleep deprivation on affect and avoidance. J. Behav. Ther. Exp. Psychiatry. 77, 101770. https://doi.org/10.1016/j.jbtep.2022.101770 (2022).
Article Google Scholar
Gebara, M. A. et al. Effect of insomnia treatments on depression: A systematic review and meta-analysis. Depress. Anxiety 35(8), 717–731. https://doi.org/10.1002/da.22776 (2018).
Article Google Scholar
Szklo-Coxe, M., Young, T., Peppard, P. E., Finn, L. A. & Benca, R. M. Prospective associations of insomnia markers and symptoms with depression. Am. J. Epidemiol. 171(6), 709–720. https://doi.org/10.1093/aje/kwp454 (2010).
Article PubMed Central Google Scholar
Fernandez-Mendoza, J. et al. Insomnia and incident depression: Role of objective sleep duration and natural history. J. Sleep Res. 24(4), 390–398. https://doi.org/10.1111/jsr.12285 (2015).
Article PubMed Central Google Scholar
Wickwire, E. M. & Albrecht, J. S. Occult, undiagnosed obstructive sleep apnea increases risk of depression among a national sample of older adult Medicare beneficiaries. Sleep Health https://doi.org/10.1016/j.sleh.2023.05.008 (2023).
Article Google Scholar
Furihata, R. et al. An aggregate measure of sleep health is associated with prevalent and incident clinically significant depression symptoms among community-dwelling older women. Sleep https://doi.org/10.1093/sleep/zsw075 (2017).
Article Google Scholar
Maier, A., Riedel-Heller, S. G., Pabst, A. & Luppa, M. Risk factors and protective factors of depression in older people 65+. A systematic review. PLoS ONE 16(5), e0251326. https://doi.org/10.1371/journal.pone.0251326 (2021).
Article CAS PubMed Central Google Scholar
Riley, R. D., Lambert, P. C. & Abo-Zaid, G. Meta-analysis of individual participant data: rationale, conduct, and reporting. BMJ (Clin. Res. Ed) 340, c221. https://doi.org/10.1136/bmj.c221 (2010).
Article Google Scholar
Cummings, S. R. et al. Bone density at various sites for prediction of hip fractures. The study of osteoporotic fractures research group. Lancet 341(8837), 72–75 (1993).
Article CAS Google Scholar
Orwoll, E. et al. Design and baseline characteristics of the osteoporotic fractures in men (MrOS) study: A large observational study of the determinants of fracture in older men. Contemp. Clin. Trials 26(5), 569–585. https://doi.org/10.1016/j.cct.2005.05.006 (2005).
Article Google Scholar
Bennett, D. A. et al. Overview and findings from the rush memory and aging project. Curr. Alzheimer Res. 9(6), 646–663 (2012).
Article CAS PubMed Central Google Scholar
Barnes, L. L., Shah, R. C., Aggarwal, N. T., Bennett, D. A. & Schneider, J. A. The minority aging research study: Ongoing efforts to obtain brain donation in African Americans without dementia. Curr. Alzheimer Res. 9(6), 734–745 (2012).
Article CAS PubMed Central Google Scholar
Ikram, M. A. et al. The Rotterdam study: 2018 update on objectives, design and main results. Eur. J. Epidemiol. 32(9), 807–850. https://doi.org/10.1007/s10654-017-0321-4 (2017).
Article PubMed Central Google Scholar
Wallace, M. L. et al. Pioneering a multi-phase framework to harmonize self-reported sleep data across cohorts. Sleep 47(9), zsae115. https://doi.org/10.1093/sleep/zsae115 (2024).
Article PubMed Central Google Scholar
Kohout, F. J., Berkman, L. F., Evans, D. A. & Cornoni-Huntley, J. Two shorter forms of the CES-D (center for epidemiological studies depression) depression symptoms index. J. Aging Health 5(2), 179–193. https://doi.org/10.1177/089826439300500202 (1993).
Article CAS Google Scholar
Radloff, L. S. The CES-D scale: A self-report depression scale for research in the general population. Appl. Psychol. Meas. 1, 385–401 (1977).
Article Google Scholar
Almeida, O. P. & Almeida, S. A. Short versions of the geriatric depression scale: a study of their validity for the diagnosis of a major depressive episode according to ICD-10 and DSM-IV. Int. J. Geriatr. Psychiatry 14(10), 858–865 (1999).
Article CAS Google Scholar
Irwin, M., Artin, K. H. & Oxman, M. N. Screening for depression in the older adult: Criterion validity of the 10-item center for epidemiological studies depression scale (CES-D). Arch. Int. Med. 159(15), 1701–1704. https://doi.org/10.1001/archinte.159.15.1701 (1999).
Article CAS Google Scholar
Wallace, M. L. et al. Heightened sleep propensity: A novel and high-risk sleep health phenotype in older adults. Sleep Health 5(6), 630–638. https://doi.org/10.1016/j.sleh.2019.08.001 (2019).
Article CAS PubMed Central Google Scholar
George, A., Stead, T. S. & Ganti, L. What’s the risk: Differentiating risk ratios, odds ratios, and hazard ratios?. Cureus 12(8), e10047. https://doi.org/10.7759/cureus.10047 (2020).
Article PubMed Central Google Scholar
American Psychiatric Association, DSM-5 Task Force. Diagnostic and Statistical Manual of Mental Disorders 5th edn. (American Psychiatric Association, 2013).
Book Google Scholar
Ten Brink, M. et al. Sleep and affect: A conceptual review. Sleep Med Rev. 65, 101670. https://doi.org/10.1016/j.smrv.2022.101670 (2022).
Article PubMed Central Google Scholar
Harvey, A. G. & Buysse, D. J. Treating Sleep Problems: A Transdiagnostic Approach 1st edn. (The Guilford Press, 2017).
Google Scholar
Farajzadeh, M. et al. The association between obstructive sleep apnea and depression in older adults. Nurs. Midwifery Stud. 5(2), e32585. https://doi.org/10.17795/nmsjournal32585 (2016).
Article PubMed Central Google Scholar
Ramos, A. R., Seixas, A. & Dib, S. I. Obstructive sleep apnea and stroke: Links to health disparities. Sleep Health 1(4), 244–248. https://doi.org/10.1016/j.sleh.2015.09.005 (2015).
Article PubMed Central Google Scholar
Grandner, M. A. Sleep, health, and society. Sleep Med. Clin. 12(1), 1–22. https://doi.org/10.1016/j.jsmc.2016.10.012 (2017).
Article MathSciNet Google Scholar
Wallace, M. L. et al. Use and misuse of random forest variable importance metrics in medicine: Demonstrations through incident stroke prediction. BMC Med. Res. Methodol. 23(1), 144. https://doi.org/10.1186/s12874-023-01965-x (2023).
Article PubMed Central Google Scholar
Paudel, M. et al. Sleep disturbances and risk of depression in older men. Sleep 36(7), 1033–1040. https://doi.org/10.5665/sleep.2804 (2013).
Article PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by NIH Grant RF1AG056331 (PI: Wallace) and by the Convergence, the alliance between Erasmus Medical Centre Rotterdam, Erasmus University Rotterdam and Delft University of Technology, as part of the A Lifecourse and Individual-based View on Lifestyle to Enhance Health (ALIVE) project (AL). The Osteoporotic Fractures in Men (MrOS) Study is supported by National Institutes of Health funding. The following institutes provide support: the National Institute on Aging (NIA), the National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS), the National Center for Advancing Translational Sciences (NCATS), and NIH Roadmap for Medical Research under the following Grant Numbers: U01 AG027810, U01 AG042124, U01 AG042139, U01 AG042140, U01 AG042143, U01 AG042145, U01 AG042168, U01 AR066160, R01 AG066671, and UL1 TR002369. The National Heart, Lung, and Blood Institute (NHLBI) provides funding for the MrOS Sleep ancillary study “Outcomes of Sleep Disorders in Older Men” under the following Grant Numbers: R01 HL071194, R01 HL070848, R01 HL070847, R01 HL070842, R01 HL070841, R01 HL070837, R01 HL070838, and R01 HL070839. The Study of Osteoporotic Fractures (SOF) is supported by National Institutes of Health funding. The National Institute on Aging (NIA) provides support under the following Grant Numbers: R01 AG005407, R01 AR35582, R01 AR35583, R01 AR35584, R01 AG005394, R01 AG027574, R01 AG027576, and R01 AG026720. Rush Memory and Aging Project (MAP) and Minority Aging Research Study (MARS) are funded under the following Grant Numbers: R01AG22018 (MARS), R01AG17917 (MAP). The Rotterdam Study is funded by Erasmus Medical Center and Erasmus University, Rotterdam, Netherlands Organization for the Health Research and Development (ZonMw), the Research Institute for Diseases in the Elderly (RIDE), the Ministry of Education, Culture and Science, the Ministry for Health, Welfare and Sports, the European Commission (DG XII), and the Municipality of Rotterdam. This research is part of the A Lifecourse and Individual-based View on Lifestyle to Enhance Health (ALIVE) flagship, funded by the Convergence, the alliance between Erasmus MC University Medical Center Rotterdam, Erasmus University Rotterdam and Delft University of Technology.

Author information

Authors and Affiliations

Department of Psychiatry, University of Pittsburgh, 3811 O’Hara Street, Pittsburgh, PA, 15213, USA
Meredith L. Wallace, Daniel J. Buysse & Meryl A. Butters
Department of Statistics, University of Pittsburgh, Pittsburgh, PA, USA
Meredith L. Wallace & Lucas Mentch
University of Pittsburgh Medical Center, Pittsburgh, PA, USA
Nina Oryshkewych
Department of Epidemiology, Erasmus MC University Medical Centre, Rotterdam, The Netherlands
Sanne J.W. Hoepel
California Pacific Medical Center, San Francisco, CA, USA
Katie L. Stone
Departments of Psychiatry and Behavioral Sciences, Neurology and Epidemiology, University of California at San Francisco, San Francisco, CA, USA
Kristine Yaffe
Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago, IL, USA
Lisa L. Barnes & Annemarie Luik
Department of Neurology, University of Toronto, Toronto, ON, Canada
Andrew S. Lim
Division of Epidemiology and Community Health and Department of Medicine, University of Minnesota, Minneapolis, MN, USA
Kristine E. Ensrud
Division of Rheumatology, Inflammation and Immunity, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Misti L. Paudel
Trimbos Institute - The Netherlands Institute of Mental Health and Addiction, Utrecht, The Netherlands
Annemarie Luik
Department of Epidemiology and Biostatistics, University of California, San Francisco, USA
Katie L. Stone

Authors

Meredith L. Wallace
View author publications
Search author on:PubMed Google Scholar
Nina Oryshkewych
View author publications
Search author on:PubMed Google Scholar
Sanne J.W. Hoepel
View author publications
Search author on:PubMed Google Scholar
Daniel J. Buysse
View author publications
Search author on:PubMed Google Scholar
Lucas Mentch
View author publications
Search author on:PubMed Google Scholar
Meryl A. Butters
View author publications
Search author on:PubMed Google Scholar
Katie L. Stone
View author publications
Search author on:PubMed Google Scholar
Kristine Yaffe
View author publications
Search author on:PubMed Google Scholar
Lisa L. Barnes
View author publications
Search author on:PubMed Google Scholar
Andrew S. Lim
View author publications
Search author on:PubMed Google Scholar
Kristine E. Ensrud
View author publications
Search author on:PubMed Google Scholar
Misti L. Paudel
View author publications
Search author on:PubMed Google Scholar
Annemarie Luik
View author publications
Search author on:PubMed Google Scholar

Contributions

Dr. Wallace oversaw the data harmonization, designed the study, interpreted the data, and wrote and edited the manuscript. Ms. Oryshkewych and Ms. Hoepel performed data harmonization and statistical analyses and assisted in data interpretation and editing the manuscript. Dr. Buysse assisted in designing the study, interpreting the data, and editing the manuscript. Drs. Barnes, Lim, Luik, Yaffe, and Stone advised on the interpretation and use of data from each cohort, interpreted the data, and assisted in editing the manuscript. Drs. Ensrud, Paudel Paudel, Mentch, Butters, and Yaffe advised on interpreting the data and assisted editing the manuscript. All authors approved the submitted version and have agreed to be accountable for their contributions.

Corresponding author

Correspondence to Meredith L. Wallace.

Ethics declarations

Competing interests

Dr. Buysse has served as a paid or unpaid consultant to Sleep Number. He is an author of the Pittsburgh Sleep Quality Index, Pittsburgh Sleep Quality Index Addendum for PTSD (PSQI-A), Brief Pittsburgh Sleep Quality Index (B-PSQI), Daytime Insomnia Symptoms Scale, Pittsburgh Sleep Diary, Insomnia Symptom Questionnaire, and RU_SATED (copyright held by University of Pittsburgh). These instruments have been licensed to commercial entities for fees. He is also co-author of the Consensus Sleep Diary (copyright held by Ryerson University), which is licensed to commercial entities for a fee. Dr. Wallace is a statistical consultant for Health Rhythms, Sleep Number, and Noctem Health, unrelated to the current work. All other co-authors have no disclosures to report.

Ethics approval

The SHARE Initiative (RF1AG056331) received an exempt Institutional Review Board (IRB) declaration for data harmonization, aggregation, and analysis of secondary data from each cohort included in the present study (STUDY21010174). The study procedures for each of the original US cohorts were approved by the Institutional Review Boards (IRBs) of their respective institutions (Sutter Health IRB # 23.104EXP; WCG IRB # 20200148; Rush University Medical Center IRB # L99032481-CR13). The Rotterdam Study procedures were approved by the Medical Ethics committee of the Erasmus MC (registration number MEC 02.1015) and by the Dutch Ministry of Health, Welfare and Sport (Population Screening Act WBO, license number 1071272-159521-PG) and has been entered into the Netherlands National Trial Register (NTR; www.trialregister.nl) and the WHO International Clinical Trials Registry Platform (ICTRP; www.who.int/ictrp/network/primary/en/) under shared catalog number NTR6831. All methods and procedures were carried out in accordance with relevant guidelines and regulations.

Informed consent

Participants in each cohort provided written informed consent to participate in the respective longitudinal studies.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Wallace, M.L., Oryshkewych, N., Hoepel, S.J. et al. An international multi-cohort investigation of self-reported sleep and future depressive symptoms in older adults. Sci Rep 15, 23918 (2025). https://doi.org/10.1038/s41598-025-07864-z

Download citation

Received: 03 October 2024
Accepted: 16 June 2025
Published: 04 July 2025
DOI: https://doi.org/10.1038/s41598-025-07864-z

Subjects

Abstract

Similar content being viewed by others

Sleep habits are associated with cognition decline in physically robust, but not in frail participants: a longitudinal observational study

Sleep characteristics across the lifespan in 1.1 million people from the Netherlands, United Kingdom and United States: a systematic review and meta-analysis

A multi-institutional cohort study on risk of sleep disorders in dry eyes patients using TriNetX

Introduction

Methods

Cohorts and analytic samples

Measures

Outcome

Self-reported sleep

Covariates

Statistical analyses

Aims 1 and 2: individual sleep characteristics and composite scores

Exploratory aim 3: moderation

Results

Preliminary descriptive analyses

Aims 1 and 2: individual sleep characteristics and composite scores

Exploratory aim 3: moderation

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethics approval

Informed consent

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links