Introduction

Iron is an essential mineral that facilitates many biological functions throughout the human body. For instance, it is key for essential functions at the systemic and cellular levels, such as oxygen transfer and storage, production and function of neurotransmitters, redox balance, inflammation regulation and cellular growth1,2.

Iron is necessary for the formation of hemoglobin, a vital oxygen-transporting protein present in the red blood cells. The delivery of oxygen from the lungs to tissues is important for the generation of adenosine triphosphate (ATP), the currency of cellular energy. Iron is also necessary for numerous enzymes involved in energy metabolism. Altogether, these functions highlight iron’s critical role in sustaining energy-intensive cellular processes, including those related to cardiovascular or neurological health3.

The ability of cells to produce energy, especially cells of the brain or heart with high energy demands, could be compromised without an adequate supply of iron (Supplementary Fig. 1). Such cells include neurons, responsible for maintaining the brain’s energy-demanding cognitive functions, and cardiomyocytes, which require substantial energy to support essential cardiac functions. In contrast, excessive iron levels can trigger the generation of free radicals, leading to oxidative stress, cellular damage, and inflammatory responses. Furthermore, these free radicals can induce the accumulation of iron in specific organs, which can result in tissue damage and impaired organ function4.

Given the importance of iron for sustaining heart and brain health, it is unsurprising that disturbances in iron metabolism have been implicated in the pathogenesis of a range of chronic diseases, including cardiovascular (CVD) and neurodegenerative diseases (NDD). CVD and NDD impose a significant burden on global health, contributing to considerable morbidity, mortality, and socio-economic implications. Understanding the underlying mechanisms driving the development and progression of these diseases is therefore essential for developing effective preventive and therapeutic strategies to improve health.

The association between iron levels (or body iron stores, measured by ferritin levels) and several CVD and NDD is reported with inconsistent or contradictory results by numerous observational and prospective studies5,6,7,8,9,10. For example, with regard to iron and CVD, a systematic review meta-analysis of prospective studies found that higher serum iron status may confer a protective effect against coronary heart disease or myocardial infarction11. Conversely, conflicting results are presented in other studies, such as Sempos et al., which reported no evidence for an association between serum ferritin levels and the development of cardiovascular endpoints in a prospective cohort study12. Additionally, Knuiman et al. reported that there was insufficient evidence regarding the role of serum ferritin levels levels as a risk factor for cardiovascular disease in a longitudinal prospective cohort study in Western Australia13.

As observational studies can be impacted by confounding factors and reverse causality, contributing to the inconsistency of reported results, alternative approaches are required to better assess how and whether iron levels are linked to CVD and NDD risk, and to determine the most plausible direction of effect.

Genetic factors contribute to inter-individual in iron levels, and the overall heritability for iron has been reported as ranging from 14 to 32%14,15,16,17. Additionally, many CVD and NDD have a significant genetic component18,19,20. Exploring the impact of iron levels on CVD and NDD from a genetic perspective presents an opportunity to identify the mechanisms and biological pathways relating iron levels and complex disease risk. Such knowledge can be actionable given that iron levels can be regulated, up to a certain extent, through dietary interventions and supplements.

Iron metabolism varies by sex, with differing absorption rates and overall levels in the body21. Additionally, hormonal fluctuations throughout the menstrual cycle can impact iron absorption and regulation in females22. These variations in iron metabolism could play crucial roles in determining disease outcomes23.

We aimed to use large-scale genetic variant-trait association summary statistics and computational genetics methods to better understand the effect of genetically predicted serum iron levels on cardiovascular or neurodegenerative diseases, as well as the direction of these relationships. Given the established sex differences for iron levels and the diseases of interest, we also carried out complementary sex-stratified analyses.

Methods

Summary statistics for meta-analysis of serum iron levels

Genome-wide association study (GWAS) summary statistics for serum iron levels were obtained from the following two cohorts for meta-analysis: Baltimore Longitudinal Study of Aging (BLSA) study24, the Montreal Heart Institute (MHI) Biobank as well as from two previously published meta-analyses: Benyamin et al. and Moksnes et al. (Table 1)16,25. For the former meta-analysis, we used the summary statistics from their discovery analysis, which combined 11 cohorts that did not overlap with other cohorts used for our meta-analysis. The latter meta-analyzed three cohorts: the Trøndelag Health Study—HUNT, the Michigan Genomics Initiative (MGI), and SardiNIA26,27,28.

Table 1 Summary of cohorts of individuals of European-like genetic ancestry for meta-analysis or serum iron levels.

Additionally, we obtained sex-stratified summary statistics for serum iron from the following three cohorts for which such data were available: Montreal Heart Institute (MHI) Biobank, MGI Biobank and SardiNIA26,27.

All summary statistics for analysis had assessed individuals inferred to be genetically similar to a European genetic ancestry reference population, which we refer to as individuals with European-like genetic ancestry. Further details of the cohorts included can be found in Supplementary Methods.

Meta-analysis of serum iron levels summary statistics

We meta-analyzed GWAS summary statistics for serum iron to increase statistical power for our subsequent analyses. We used the sample-size weighting scheme in METAL to perform a fixed-effects meta-analysis29. The sample-size weighting scheme allows for the analysis of studies with different scales of measurements. All summary statistics included were derived from individuals of European-like genetic ancestry29. Autosomal SNPs and short indels were included, and chromosome X variants were included when available. We used the LiftOver tool to ensure that all summary statistics were on human genome build GRCh3730. A meta-analysis using sex-combined summary statistics was conducted by combining summary statistics from two previous meta-analyses (Benyamin et al.25 and Moksnes et al.16) and two other cohorts: the Baltimore Longitudinal Study of Aging (BLSA)24 and the Montreal Heart Institute (MHI) Biobank. We used FUMA (version 1.5.2) to annotate our iron meta-analysis results31.

GWAS summary statistics for cardiovascular- and neurodegenerative-related traits

We selected 11 large GWAS (> 20,000 cases for binary traits) conducted in individuals of European-like genetic ancestry (so to match the genetic ancestry of our iron meta-analysis) for neurodegenerative (Alzheimer’s disease32, Parkinson’s disease33, amyotrophic lateral sclerosis34), cardiovascular (coronary artery disease35, heart failure36), cerebrovascular (stroke37) and lipid (triglycerides, and the following cholesterol: total, low-density lipoprotein, high-density lipoprotein, non-high-density lipoprotein38) traits (Supplementary Fig. 2).

Global genetic correlations between iron and cardiovascular or neurodegenerative traits

We used cross-trait linkage disequilibrium score regression (LDSC; v1.0.1) to assess genome-wide genetic correlations (rg) between serum iron and each of the 11 traits of interest, based on common autosomal single nucleotide polymorphisms (SNPs)39. In brief, LDSC regresses the product of the marginal z-scores in the GWAS summary statistics for the two traits on the LD scores of each SNP. We processed the summary statistics for each trait through LDSC’s munge-sumstats.py script, which retained 1,217,312 common autosomal SNPs from the HapMap phase 3 project. Data from the European super-population of the 1000 Genomes phase 3 project was used for the LD reference panel (available at https://github.com/bulik/ldsc). Due to the long-range and complex linkage disequilibrium within the major histocompatibility complex region on chromosome 6, SNPs within this region are excluded.

Local genetic correlations between iron and cardiovascular or neurodegenerative traits

We used LAVA (Local Analysis of [co]Variant Association; v0.0.6) to estimate regional genetic correlations between iron and the 11 traits of interest. LAVA divides the autosomes into 2459 non-overlapping loci based on the European super-population of the 1000 Genomes Project as the linkage disequilibrium reference40. LAVA estimates local genetic correlation for each iron-trait pair at loci with genetic signal in both traits. Specifically, first, a univariate test was performed to identify which loci have genetic signal by testing for per-trait local heritability. Then, for loci where both traits had a p-value in the univariate test that passed Bonferroni correction (p < 0.05/2459), we then performed the bivariate (genetic correlation) analysis. We also applied a Bonferroni correction at the bivariate stage, reporting loci passing a p-value threshold for each iron-trait pair tested of 0.05/number of comparisons. Summary statistics for Alzheimer’s and Parkinson’s included proxy cases (individuals who have a close relative with the disease of interest, but who do not have the disease themselves). For comparison, we also carried out a sensitivity analysis using summary statistics that excluded proxy cases for Alzheimer’s41 and Parkinson’s33, which have lower sample sizes (N cases = 21,982 and N controls = 41,944 for Alzheimer’s, and N cases = 17,996 and N controls = 16,502 for Parkinson’s).

Causal inference approaches (Mendelian randomization) to elucidate links between iron and cardiovascular or neurodegenerative traits

Mendelian randomization (MR) is an epidemiological instrumental variable approach that uses genetic variant-trait associations to assess causal relationships between genetically predicted values of an exposure and an outcome, given certain assumptions. We carried out a series of two-sample (MR) analyses with iron as the exposure and one of the 11 selected traits as the outcome. There are three core assumptions that must be met for the causal inference to be valid. First, the genetic variants selected as genetic instruments are robustly associated with the exposure. Second, the genetic instruments are not associated with confounding factors, measured or unmeasured. Finally, the relationship between the genetic instruments and the outcome is only through the exposure. The first assumption can be quantitatively evaluated by calculating the F statistic for the selected genetic instrument and the exposure, where an F statistic greater than 10 is standardly considered a robust association. There is no quantitative test per se to ensure the validity of the second and third assumptions, but sensitivity analyses, including those that assess the presence of horizontal pleiotropy, such as MR-Egger, are commonly conducted to ensure the consistency of results.

We verified that the selected genetic variants do not exhibit evidence of association with confounding traits (see “Results” section for details) or with the tested outcomes by searching in the GWAS Catalog and in PhenoScanner (v2)42,43. Variants that did exhibit evidence were excluded.

We used the R package TwoSampleMR (version 0.5.7) to carry out two-sample MR analyses44. For our primary analysis, we used the inverse variance weighted approach with random effects, followed by MR Egger, weighted median and weighted mode as secondary analyses to assess the consistency of the results. We report the Cochran’s Q test for heterogeneity in causal estimates for the inverse variance weighted method across the genetic variants used as instruments. We performed the MR Egger intercept test to assess the presence of pleiotropic variants45. We also performed MR-PRESSO (version 1.0) using the R package MR-PRESSO to identify and remove horizontal pleiotropic outliers from our MR analyses46, as well as a leave-one-out analysis to evaluate whether the results were influenced by a specific genetic instrument. Finally, for each iron-outcome pair tested, a directionality test was performed using TwoSampleMR to assess the validity of the assumption that the exposure (genetically predicted iron levels) causes the outcome rather than the other way around.

Sex-stratified analyses

We also performed a sex-stratified meta-analysis following a similar protocol as the sex-combined meta-analysis, using data from the three cohorts for which we had access to sex-stratified summary statistics (SardiNIA, Montreal Heart Institute (MHI) Biobank, and Michigan Genomics Initiative (MGI) Biobank). The total sample size tested at each genetic variant was up to 11,547 females and 9090 males. Then, we estimated global and local genetic correlations, and finally, we performed MR using the same workflow as previously described with the sex-stratified iron meta-analysis summary statistics and sex-specific summary statistics from the following 3 conditions with large-scale sex-stratified summary statistics available from European-like study samples: Parkinson’s disease33, coronary artery disease35 and amyotrophic lateral sclerosis47 (summary statistics provided by co-authors RPB and RLM). As in the sex-combined summary statistics, the summary statistics for all studies used here were derived from individuals of European-like genetic ancestry.

Results

Meta-analysis of iron summary statistics

In our meta-analysis of serum iron GWAS studies, we identified a total of 23 genome-wide significant (p-value < 5 × 10–8) loci (Fig. 1, Table 2). No significant associations were identified on the X chromosome, but sample sizes available for testing variants on this chromosome were limited as the Benyamin et al. and BLSA studies included in our meta-analysis did not provide association statistics for chromosome X variants. The most significant associations were observed on chromosome 22 at TMPRSS6 (p-value = 2.17 × 10–375), and on chromosome 6 at HFE (p-value = 2.23 × 10–375). 20 of the 23 loci had been previously identified at genome-wide significance in at least one of the GWAS studies included in our meta-analysis. The three loci that had not been previously identified (corresponding to closest genes, TFRC, PDGFRA and NEO1) were all suggestive signals in the Moksnes et al.16 serum iron meta-analysis (p < 2 × 10–6). As expected for a complex trait, the most significant variants fell into intergenic or intronic regions (Supplementary Fig. 3).

Fig. 1
figure 1

Manhattan plot of the results from the GWAS meta-analysis of serum iron using a sample-size weighted z-score method resulting in a total sample size of 263,167 individuals of European-like genetic ancestry. The dashed line presents the threshold to denote genome-wide significance (5 × 10–8). We replaced extremely significant values in our analysis with a minimum non-zero value that plotting libraries can handle. Note rescaling on the y-axis represented by two horizontal lines (“=”).

Table 2 Summary of top hits in meta-analysis of serum iron GWAS.

The estimated heritability of serum iron levels on the observed scale using LD score regression based on the 1000 Genomes European genetic ancestry super-population as the LD reference was estimated to be 0.0362 ± 0.008.

Global genetic correlations between iron and cardiovascular or neurodegenerative traits

We identified four significant correlations (p < 0.05) between serum iron and a tested trait (Supplementary Fig. 4). Specifically, we observed a significant positive genetic correlation between iron levels and HDL cholesterol. On the other hand, negative correlations were observed for iron and cardiovascular diseases (coronary artery disease and heart failure), as well as with triglycerides.

Local genetic correlations between iron and cardiovascular or neurodegenerative traits

Local genetic correlation can differ from global genetic correlation, i.e. from the genome-wide average correlation. For a pair of traits, if certain genomic loci are positively correlated, but other loci are negatively correlated, these effects could be canceled out in a global genetic correlation analysis. A published example of such a scenario is between HDL and LDL. Globally, these two traits are not strongly significantly correlated with each other, but locally, multiple regions are significantly correlated48. Additionally, global genetic correlation analysis can miss findings if strong genetic correlations are restricted to specific genomic regions49. We therefore assessed local/regional genetic correlations between iron and the eleven traits of interest. We identified a total of 194 significantly correlated loci (presented in Supplementary Table 1) between iron and the tested traits (when testing Alzheimer’s and Parkinson’s disease data that included proxies). The number of significant loci for each iron-trait pair tested ranged from one locus (between iron and either stroke or coronary artery disease) to 106 loci (between iron and Alzheimer’s disease) (Fig. 2a). To gain insight from the 106 loci significantly correlated between iron and Alzheimer’s disease, we carried out a gene-set enrichment analysis. Specifically, for each of the 106 loci, we obtained the list of protein coding genes falling into the genomic coordinates and used the Gene2Func function of FUMA (version 1.5.2) to identify enriched gene-sets using all protein coding genes as the background gene-set31. We applied a Bonferroni correction and required a minimum of 5 genes overlapping with a gene-set. Of the significantly enriched gene-sets, those related to metals homeostasis and cellular response, including for copper and zinc showed evidence of enrichment, as well as gene-sets related to fatty acids, oxidation by cytochrome P450 and eicosanoids (Supplementary Fig. 5). The enriched gene-set of eicosanoids (lipid-based signaling molecules involved in innate immune response)50 may suggest that the innate immune system could be a potential link between genetically predicted iron levels and Alzheimer’s disease. Indeed, multiple lines of evidence have highlighted the role of the immune system in Alzheimer’s disease, as previously reviewed51,52, and iron homeostasis is known to be tied to inflammatory response, which has also been previously reviewed53,54. When using the Alzheimer’s and Parkinson’s summary statistics excluding proxy cases, one locus remained significantly correlated between serum iron levels and Alzheimer’s and one locus remained significantly correlated between serum iron levels and Parkinson’s, likely due to limited power. Both those loci were also significant in the primary analysis including proxy cases, and there was agreement in the direction of effects.

Fig. 2
figure 2

Local genetic correlations between iron and NDD or CVD. Chromosomal visualization (created using PhenoGram78) of significantly correlated regions between iron and the 11 neurodegenerative or cardiovascular traits tested as listed in Supplementary Table 1 (a). Illustration of selected significant shared regional correlations on chromosome 6 (either just upstream or within the major histocompatibility complex) between serum iron and lipid traits tested (b). **Significant local correlation after Bonferroni correction; s: p-value < 0.05 but not significant at the Bonferroni correction threshold; ns non-significant (p-value > 0.05), AD Alzheimer’s disease, PD Parkinson’s disease, ALS amyotrophic lateral sclerosis, CAD coronary artery disease, HF heart failure, TC total cholesterol, TG triglycerides, HDL high-density lipoprotein cholesterol, LDL low-density lipoprotein cholesterol, nonHDL non-high-density lipoprotein cholesterol.

We noted that 58 (30%) of all significant loci were found between iron and a tested lipid trait (specifically, HDL cholesterol, LDL cholesterol, total cholesterol or triglycerides, corresponding to 36 unique genomic regions, 19 of which are situated within or nearby the major histocompatibility complex region on chromosome 6 (Fig. 2b). The major histocompatibility region is known for its complex long-range linkage disequilibrium patterns and for encoding a set of highly polymorphic genes that are key for immune response and immune system function55. This observation warrants further work on immunity and inflammation to assess the relationship between genetically predicted iron levels and lipid traits in downstream studies.

The locus (chr22:37,364,005–38,718,589, GRCh37) encompassing the genes most significantly associated with serum iron in our meta-analysis, TMPRSS6, was significantly correlated with triglycerides (rg = 0.370075, p-value = 9.63e-06). The local genetic correlation analyses did not show any correlation with any of the traits tested for locus on chromosome 6 with the second most significant signal (HFE).

Causal inference approaches (Mendelian randomization) to elucidate links between iron and cardiovascular or neurodegenerative traits

The completed MR-STROBE checklist is found in Supplementary Table 2.

To assess the validity of the three core MR assumptions, we selected independent genetic variants at genome-wide significant loci (p < 5 × 10–8). Through searches through the GWAS Catalog and PhenoScanner database, we also excluded variants with evidence of association with various traits that could be confounders acting on the exposure and the various outcomes tested. Such traits included cardiovascular diseases (including coronary artery disease, hypertension, and cardiomegaly), metabolic traits (including body mass index, body fat percentage), neurological disorders (including multiple sclerosis), inflammatory diseases (such as Crohn’s disease and rheumatoid arthritis), respiratory diseases (including asthma), and specific causes of mortality (such as malignant neoplasms and intestinal perforation). Additionally, variants identified as potential outliers via MR-PRESSO were removed from analyses. Finally, variants associated with the outcomes (cardiovascular and neurodegenerative traits) were not selected as genetic instruments.

Two-sample MR also assumes that the participants contributing to the summary statistics of the exposure are different from the participants contributing to the summary statistics of the outcome. To our knowledge, there should be minimal (if any) overlap for our analyses assessing iron levels as the exposure for each of the 11 outcomes.

Our primary analysis, using the inverse variance weighted approach, yielded evidence of a significant (p < 0.0045, accounting for eleven exposure-outcome comparisons) causal effect between genetically predicted iron levels and total cholesterol (OR [95% CI] = 0.94 [0.92–0.97], p = 3.3 × 10–6) and non-high-density lipoprotein cholesterol (OR [95% CI] = 0.95 [0.92–0.98], p = 3.9 × 10–3) (Fig. 3), suggesting a protective effect of iron. That is to say, modestly higher genetically predicted iron levels result in either modestly lower total cholesterol or non-high-density lipoprotein cholesterol. Secondary analyses using either the weighted median or weighted mode approach offered support for the results of the inverse variance weighted approach (p < 0.0045 and consistent direction of effect). The genetic instruments used in each analysis, including their corresponding F statistic (all greater than the accepted threshold of 10), and their corresponding effect sizes and standard errors in the exposure and outcome GWAS are presented in Supplementary Table 3. Results from the primary and secondary analyses are presented in Supplementary Table 4. Although MR Egger results did not reach statistical significance, the direction of effect (i.e. direction of the beta) was consistent, reinforcing the observed results.

Fig. 3
figure 3

Mendelian randomization results with genetically predicted iron levels as the exposure and each of the 11 neurodegenerative, cardiovascular or cerebrovascular traits tested as the outcome with inverse-variance weighted (IVW) approach as the primary analysis and MR Egger, weighted median and weighted mode as secondary analyses. AD Alzheimer’s disease, PD Parkinson’s disease, ALS amyotrophic lateral sclerosis, CAD coronary artery disease, HF heart failure, TC total cholesterol, TG triglycerides, HDL high-density lipoprotein cholesterol, LDL low-density lipoprotein cholesterol, nonHDL non-high-density lipoprotein cholesterol.

We constructed leave-one-out plots, scatterplots of effect sizes in the exposure versus the outcome and funnel plots for each of the exposure-outcome pairs tested (Supplementary Figs. 68).

The directionality tests suggested that for all iron-outcome pairs tested, the direction of iron as the exposure was most probable. Results of the directionality test are presented in Supplementary Table 5. Heterogeneity tests for the inverse variance weighted method suggest some heterogeneity for certain iron-outcome pairs tested, including for iron with non-HDL (Q = 25.7, p = 0.04), which may bias the causal estimates (Supplementary Table 6). None of the analyses, apart from low-density lipoprotein cholesterol (MR Egger intercept = -0.003, p = 0.04) showed evidence of horizontal pleiotropy as per the MR Egger intercept test (Supplementary Table 7).

Sex-stratified analyses

Our sex-stratified meta-analyses of serum iron using sex-stratified summary statistics from three cohorts for which sex-stratified results were available (Montreal Heart Institute (MHI) Biobank, Michigan Genomics Initiative (MGI) Biobank, and SardiNIA) yielded the same two strongest genome-wide significant signals as the overall meta-analysis, corresponding to the genes HFE and TMPRSS6 on chromosomes 6 and 22 respectively (Supplementary Fig. 9). Similar to the overall meta-analysis, there were no significant associations on the X chromosome in the sex-stratified analyses, but sample size was lower on this chromosome compared to the autosomes, as the SardiNIA cohort did not test X chromosome variants for association.

Global genetic correlations did not reveal statistically significant associations between iron levels and Parkinson’s disease, coronary artery disease, or amyotrophic lateral sclerosis for the sex-stratified analyses (Supplementary Fig. 10). However, our analysis of local genetic correlations identified significantly correlated loci for iron levels and these three diseases (Supplementary Figs. 1113, Supplementary Table 8). A locus on chromosome 12 (chr12:23,923,799–25,058,714, GRCh37) was significantly correlated between iron and Parkinson’s disease in females, whereas two loci were correlated in the corresponding male analysis (chr8: 98,132,520–99,388,944 and chr19:55,714,086–56,411,086). There were two loci significantly correlated between iron and coronary artery disease in the female analysis (chr19: 54,602,371–55,182,973 and chr20:42,185,672–43,008,890) and another two in the male analysis (chr6:110,647,843–111,363,443 and chr9:136,770,927–137,593,527). One locus on chromosome 9 (chr9:132,193,799–132,999,452) showed significant correlation in the male analysis between iron and amyotrophic lateral sclerosis and there were no significantly correlated loci in the corresponding female analysis. These sex-stratified local correlations also need to be replicated in larger sample sizes.

Finally, through sex-specific MR, we observed a potential near-significant causal effect of iron levels on Parkinson’s disease among females in our primary analysis using the inverse variance weighted approach (OR [95% CI] = 0.79 [0.64–0.98], p = 0.03). Direction of effect was consistent for the secondary analyses. Three genetic instruments were used in the stratified MR analyses (Supplementary Table 9). No significant causal effects were detected for coronary artery disease or amyotrophic lateral sclerosis (Supplementary Figs. 1417, Supplementary Table 10). Directionality test results suggested that iron as the exposure (i.e. not the outcome) was most likely direction (Supplementary Table 11). We observed neither evidence of heterogeneity through heterogeneity test results from the inverse variance weighted approach (Supplementary Table 12) nor the presence of horizontal pleiotropy through the MR Egger intercept test (Supplementary Table 13).

Discussion

Serum iron is important for various bodily functions, including those involved in the cardiovascular and neurological systems. We aimed to better understand the contribution of genetically predicted serum iron levels on cardiovascular and neurodegenerative diseases by applying computational approaches to large-scale GWAS summary statistics. To accomplish this aim, we meta-analyzed iron studies to increase statistical power and used eleven large cardiovascular- or neurodegenerative-related GWAS results. We supported our primary analyses with sex-stratified analyses, data permitting, given the established sex differences in iron levels and the various traits tested.

In summary, our meta-analysis of serum iron studies identified 20 genome-wide significant loci previously reported and unveiled 3 novel loci (which were subthreshold significant in a previous meta-analysis, thus highlighting the importance of increasing sample size). Further investigation into these loci may reveal novel biological mechanisms underlying iron homeostasis.

The results of our global genetic correlations analyses underscore a complex interplay between serum iron levels and lipid profiles, with a significant positive genetic correlation observed between iron levels and HDL cholesterol. This finding suggests that higher, but within the normal range, genetically predicted iron levels is correlated with a possibly beneficial lipid profile, since HDL cholesterol is generally associated with reduced risk of cardiovascular disease56,57. Negative genetic correlations were observed between iron and the following traits: coronary artery disease, heart failure and triglyceride levels, which could suggest that iron plays a protective role that contributes to improved cardiovascular health.

Local genetic correlations analysis further elucidated the genetic architecture underlying the relationship between iron and neurodegenerative and cardiovascular traits. Differences in the number of significant regions/loci identified could also reflect differences in study sample size, statistical power, or the prevalence of these conditions in the populations studied. The largest number of associated loci (n = 106) between iron and Alzheimer’s disease could suggest a genetic connection between iron levels and the risk of this neurodegenerative condition. However, correlation does not imply causation, nor does it inform on direction of effect. Causal inference analysis, through Mendelian randomization (MR), did not yield significant findings between iron levels and Alzheimer’s disease, but the directionality test pointed to iron levels to Alzheimer’s as the most likely direction.

MR analyses revealed a potential positive causal effect of iron levels on lipid profile traits, notably total, LDL and non-HDL cholesterol, suggesting a possible beneficial effect of higher iron levels on lipid profiles. However, the presence of heterogeneity for certain iron-outcome pairs, such as in the analysis of iron with non-HDL cholesterol, may have biased the causal estimates.

Experimental and clinical findings concerning sex-related differences in iron metabolism for certain cardiovascular58,59 and neurodegenerative21 traits are receiving increased attention. Females, particularly those of childbearing age, generally have a higher prevalence of iron deficiency and anemia than males23,60. In contrast, while menstruating females may have lower iron stores, males are more likely to be iron overloaded61,62.

In addition, sex hormones play a crucial role in regulating iron homeostasis. Estrogen and progesterone influence iron regulation through hepcidin activity, thus contributing to variations in iron levels throughout the menstrual cycle22. Clinical and experimental evidence suggests that high estrogen levels correlate with increased systemic iron availability63. Conversely, very low testosterone levels can be observed in cases of severe iron overload64, while increased testosterone levels have been found to increase dietary iron absorption and body iron stores by inhibiting hepcidin synthesis65.

These differences in iron metabolism underline the importance of carrying out sex-stratified analyses to allow for the discovery of tailored interventions and treatments. Our sex-specific MR analysis suggested a potential causal effect of iron levels on Parkinson’s disease in females, indicating that higher genetically predicted iron levels within the normal range could play a protective role against the development of Parkinson’s in females. Results in males were insignificant, but further analyses in larger sample sizes should be performed to assess whether differences in statistical power can explain the differing results in the sex-stratified analyses.

Parkinson’s is a progressive neurodegenerative disease, manifesting as motor and non-motor symptoms marked by neuronal loss in the pars compacta of the substantia nigra66. Despite extensive research, the primary disease mechanisms remain unclear, although several theories have been proposed, including mitochondrial dysfunction, inflammation, abnormalities in protein management and oxidative stress67. Various factors contributing to oxidative stress in Parkinson’s have been investigated, including aging, neuroinflammation and iron overload68. Animal studies using neurotoxin models of Parkinson’s disease have demonstrated that excess iron levels in the brain can contribute to loss of dopaminergic neurons, a hallmark of the disease69. Sex-differing factors in Parkinson’s disease epidemiology and clinical progression have been previously reported70,71,72. There may be neuroprotective effects of estrogen in females, delaying Parkinson’s disease age of onset73, and other work has reported that estrogen treatment can improve motor and cognitive function in females with Parkinson’s74.

We acknowledge limitations of our work. First, all cohorts assessed were of European-like genetic ancestry, which does not represent the world’s diversity. It is imperative to include diverse sets of participants, as data become available to allow for the generalizability of findings.

Second, we acknowledge the importance of sex-stratified analyses given sex-biases in iron levels as well as the neurodegenerative and cardiovascular traits of interest. Sex, as a biological variable, exerts a wide range of effects on genome function and resulting phenotypes, and these effects can be mediated by a variety of considerations including sex-differing gene expression or differing hormonal levels. However, sample sizes in our sex-stratified analyses were limited due to lack of availability of sex-stratified results. Increased sample sizes in these analyses may reveal sex-biased effects for which we did not have sufficient power to detect in the current analyses. On a similar note, chromosome X was not assessed in all studies included in our analyses, leading to less power to detect findings on that chromosome compared to the autosomes.

Third, research has indicated systemic iron status may not correlate well with either brain or cardiac iron levels75,76,77. Further investigations are needed to understand the role of iron in cardiovascular and neurological health, accounting for both systemic and organ-specific levels.

Another limitation pertains to the variation in iron status across different age groups and life stages. For instance, iron levels are influenced by the characteristics of female life stages, such as prepuberty, post-puberty, pregnancy, breastfeeding and menopause. Failure to control for these factors is a limitation, and incorporating information on life stage in future work when such data are available at-scale will enhance our understanding of the impact of genetically predicted serum iron levels on disease risk.

What is more, we note that iron levels of participants in the cohorts included in our meta-analysis tended to be lower than or within the normal range for serum iron levels (see Table 1). Future work assessing individuals with iron levels above the normal range may increase power.

In future investigations, our understanding can be broadened by examining additional iron biomarkers, such as transferrin and ferritin, alongside serum iron, to gain additional insight of the role of genetically predicted iron distribution in the body on cardiovascular and neurological health. Moreover, as data become available, longitudinal analyses to uncover the temporal relationship between iron dysregulation and the onset and progression of cardiovascular and neurodegenerative diseases should be pursued.

We also encourage exploration into other substances, like copper and magnesium, which share metabolic pathways with iron, for their interactions and associations with cardiovascular and neurodegenerative disease risk, potentially informing dietary or supplemental interventions.

In conclusion, our findings underscore potential links between genetically predicted serum iron levels in cardiovascular and neurodegenerative diseases, warranting further research to elucidate causal mechanisms and explore preventive and therapeutic strategies, including the consideration of sex-differing differences. While our findings presented here provide initial insights, future studies incorporating larger sample sizes, other types of genetic variation in addition to SNPs, are needed to replicate and validate these findings. Additionally, integrating data on hormonal status and increasing genetic diversity of the analysis cohorts will be key to provide insights into effects that may differ by sex or genetic ancestry.