Introduction

Long-term cannabis use has been linked to a number of health problems, including respiratory problems, cardiovascular diseases, cognitive decline, and mental illness [1]. Although some associations are inconsistent, evidence suggests that exposure to cannabinoids such as delta-9-tetrahydrocannabinol (THC) may pose risks to health. For example, cannabis administration studies have shown that THC, the main psychoactive constituent of cannabis, has acute effects on the vasculature [2] and on mental health symptoms [3]. Moreover, it is well known that smoking has negative effects on various aspects of health, and most people who use cannabis smoke it [4, 5]. These findings support the biological plausibility of effects of long-term cannabis use on some aspects of health.

Epigenetic responses to substance use, which can influence gene expression without altering genetic sequence, are increasingly recognized as mechanisms linking substance use to adverse health outcomes [6]. DNA methylation is one epigenetic mechanism, and it involves the addition or removal of methyl groups to DNA, usually at cytosine-guanine dinucleotides (CpG sites) [7]. Of all the substances, tobacco smoking has most consistently been associated with differential DNA methylation, notably hypomethylation of a CpG site located in the aryl hydrocarbon receptor repressor (AHRR) gene -- cg05575921 [8]. Hypomethylation of this site has been found in some studies to predict the development of lung cancer [9]. Importantly, hypomethylation of cg05575921 is linked to tobacco smoking and not non-combustible tobacco use [8], suggesting that it is exposure to the harmful chemicals in smoke, as opposed to nicotine, that causes the hypomethylation. The inhaled combustion products for cannabis and tobacco are largely similar [10], raising the question: do people who smoke cannabis show similar patterns of differential methylation to those who smoke tobacco?

The DNA methylation profiles of tobacco users have been fairly well characterized, but few studies have reported on the DNA methylation profiles of cannabis users. This represents a significant knowledge gap, especially given cannabis legalization and increases in cannabis use [11,12,13,14,15]. We identified eight studies published by August 2024 that reported on cannabis-related DNA methylation for up to eight hundred and fifty thousand CpG sites (Table S1) [16,17,18,19,20,21,22,23]. The studies generally found that cannabis use was associated with differential methylation of a small number of CpG sites after adjusting for multiple testing. However, the studies disagreed on which sites and on whether tobacco smoking accounted for cannabis associations. Moreover, some studies relied on small, unrepresentative samples. Most studies used relatively crude measures of cannabis use. For example, two studies distinguished only between those who had ever used versus never used cannabis [17, 23]. No studies examined whether cannabis cessation could restore DNA methylation profiles to that of non-users.

We aimed to redress these limitations in a representative birth cohort followed prospectively to midlife. We capitalized on the study’s strong and repeated assessments of cannabis use over a span of thirty years to identify, at midlife, a group of long-term cannabis users. Long-term cannabis users are of special clinical interest because long-term cannabis use is associated with more health, cognitive, and social problems compared with occasional, recreational use [1, 24,25,26,27]. Moreover, midlife is of interest because it is the developmental period when physical health decline becomes apparent [28] and when chronic diseases tend to onset [29].

This study addressed five questions. First, do previously reported cannabis-methylation associations replicate in a group of middle-age long-term cannabis users from an independent cohort? We elected a replication approach, as opposed to conducting an exploratory epigenome wide association study (EWAS), because EWAS requires very large samples to detect what are likely small-effect associations across hundreds of thousands of sites. We selected for analysis a set of 246 previously published cannabis-associated CpG sites (Table S2).

Second, how do long-term cannabis users compare with lifelong non-users and long-term tobacco users? We compared long-term cannabis users not only with non-users but also with long-term tobacco users, because many people who use cannabis also smoke tobacco and because the methylation profiles of long-term tobacco users are well characterized. Therefore, tobacco findings provide important context for interpreting cannabis findings. Analyses took two complementary approaches: (1) group comparisons of long-term cannabis users with lifelong non-users and long-term tobacco users, and (2) tests of dose-response associations, using, as the exposures, continuously-distributed quantitative measures of the extent of persistence of regular cannabis use and persistence of tobacco dependence over a lifetime. We elected to use a group comparison approach because it is the approach used in case-control studies. We elected to test dose-response associations because these tests are statistically powerful, and dose-response associations would be expected if associations were causal. Consistent findings across the two approaches would bolster confidence in cannabis-methylation associations.

Third, is differential DNA methylation among long-term cannabis users robust to control for confounders? We selected a set of plausible confounders and included them as covariates in tests of dose-response associations: childhood socioeconomic deprivation; low childhood self-control; family history of substance dependence; and persistence of tobacco, alcohol, and illicit drug use.

Fourth, do any of the cannabis-related methylation markers represent meaningful biological associations? Analyses examined correlations between cannabis-related methylation marker levels and whole-genome gene expression. Fifth, does cannabis cessation reverse differences in DNA methylation profiles? Analyses compared cannabis and tobacco quitters with non-users and with long-term cannabis users and long-term tobacco users. Our approach to addressing these questions is summarized in Fig. 1. Replicated findings of cannabis-related differential DNA methylation would suggest alterations in DNA methylation that could have important implications for later health.

Fig. 1
figure 1

Flowchart documenting approach to CpG site selection and data analysis.

Methods

Participants

Participants are members of the Dunedin Longitudinal Study, a representative birth cohort (N = 1,037; 91% of eligible births; 52% male) born April 1972-March 1973 in Dunedin, New Zealand (NZ), who were eligible based on residence in the province and who participated in the first assessment at age 3 years. As adults, the cohort’s distributions match the range in national surveys of same-age New Zealanders on educational attainment, body mass index, smoking, physical activity, and physician visits [30, 31]. Study participants are primarily of New Zealand European ethnicity; 8.6% reported Māori ethnicity at age 45.

Assessments were carried out at birth and ages 3, 5, 7, 9, 11, 13, 15, 18, 21, 26, 32, 38, and most recently (completed April 2019) 45 years. Of the original 1037 cohort participants, 997 were still alive at age 45; of these, 938 (94.1%) participated in the age-45 assessments [474 men (50.5%)]. Participants assessed at age 45 did not differ significantly from other living participants in terms of childhood SES, childhood IQ, or history of psychopathology (Fig. S1).

Ethics approval and consent to participate

Participants gave written informed consent. Study protocols were approved by the NZ Health and Disability Ethics Committee (Protocol: 2017-1211). All methods were performed in accordance with the relevant guidelines and regulations.

Measures

Measures are described here and more details are shown in Table S3.

Age-45 DNA methylation

At age 45, DNA extracted from peripheral whole blood was assayed using the EPIC 850 K BeadChip. Analyses are restricted to cohort members who did not self-identify as Māori at the time of the first blood draw and who consented to phlebotomy and whose DNA methylation passed quality control (n = 818).

As previously described [32], whole blood was collected in 10 mL K2EDTA tubes from 90% (N = 824) of participants at age 45. DNA was extracted from the buffy coat using standard procedures [33, 34]. ~500 ng of DNA from each sample was treated with sodium bisulfite using the EZ-96 DNA Methylation kit (Zymo Research, CA, USA). DNA methylation was quantified using the Illumina Infinium HumanMethylationEPIC BeadChip (“Illumina EPIC BeadChip”) run on an Illumina iScan System (Illumina, CA, USA) at the Molecular Genomics Core at the Duke Molecular Physiology Institute. All sample processing and array profiling was performed as a single experiment (i.e. at the same time).

Data were processed using ‘wateRmelon’ (v1.26.0; [35]) and were normalized using the ‘methylumi’ (v2.30.0; [36]) Bioconductor package from the R statistical programming environment and subjected to quality control (QC) analyses. Samples were removed if the average detection p-value was >=0.05. To confirm genetic identity of the DNA samples, we assessed genotype concordance between SNP probes on the EPIC array and data generated using Illumina OmniExpress12v1.1 genotyping BeadChips. Principal components analysis was performed on the full, normalized dataset and the first two components plotted [37]. Samples formed two major clusters separating on the 1st component, which corresponded to recorded sex. This was used to confirm sex assignment. Samples from 818 age-45 participants passed our QC pipeline.

To permit control for technical variation, we used residualized DNA methylation values for the principal components (PCs) using DNA methylation beta values for the 186 QC probes on the EPIC BeadChip. A total of 32 PCs were retained explaining 90% of the variation in QC probes. To control for cell type composition, we also included white cell-type counts measured using flow cytometry (Sysmex Corporation, Japan) in whole blood samples taken concurrently with the DNA sample.

Substance use

At ages 18, 21, 26, 32, 38, and 45, participants were interviewed about their substance use using the Diagnostic Interview Schedule [38, 39], and past-year substance-use dependencies were assessed following Diagnostic and Statistical Manual of Mental Disorders criteria. This information was used to define exposures for group comparisons and for tests of dose-response associations.

Groups for comparison

Groups comprised cohort members who met criteria at age 45 for (i) long-term cannabis use, (ii) long-term tobacco use, (ii) lifelong non-use of cannabis and tobacco, (iv) former cannabis use, (v) former tobacco use (Fig. S2). Subsets of cohort members who met criteria for these pre-registered groups were selected as described below and as previously published [24, 40, 41] to mimic groups from case-control studies. If a cohort member did not meet criteria for a group, they were not included in group comparisons (n = 332), but they were included in tests of dose-response associations. Table S4 shows background characteristics and substance use for the cohort and for each group described below.

Long-term cannabis users (n = 74; 68% male) used cannabis weekly or more frequently in the past year at age 45, or were dependent on cannabis at age 45, and also used weekly or more frequently at one or more previous assessment waves. Of these, 28% used cannabis before age 18; 74% met criteria for cannabis dependence at one or more waves; and 89% used regularly (4+ days per week) at one or more waves. Age-45 cannabis consumption was a median of 300 days in the past year, with 62% using 4+ days per week. Most long-term cannabis users had a history of tobacco dependence (84%) and smoked tobacco daily at age 45 (66%).

Long-term tobacco users (n = 57; 40% male) smoked tobacco daily at age 45 and also smoked daily at one or more previous waves; were free from cannabis at age 45; and had no history of weekly cannabis use or dependence.

Lifelong cannabis/tobacco non-users (n = 189; 40% male) never used cannabis, never used tobacco daily, and never had a diagnosis of any substance-use disorder.

Cannabis quitters (n = 50; 60% male) did not use cannabis at age 45 but previously either were diagnosed with cannabis dependence or used cannabis regularly (4+ days per week). Most had a history of tobacco dependence (60%) and a quarter smoked tobacco daily at age 45.

Tobacco quitters (n = 148; 45% male) did not use tobacco at age 45 but were previously diagnosed with tobacco dependence. Nearly a third of tobacco quitters had a history of cannabis dependence (29%), but few of them used cannabis regularly at age 45 (5%).

Quantitative exposures for tests of dose-response associations

We report on two previously published quantitative exposures [24, 40]: persistence of regular cannabis use and persistence of tobacco dependence.

Persistence of regular cannabis use (i.e., 4+ days per week) comprised participants who (i) never used cannabis (n = 244), (ii) used but never regularly (n = 450), (iii) used regularly at one study wave from age 18–45 years (n = 44), (iv) two waves (n = 26), (v) three waves (n = 26), and (vi) 4+ waves (n = 27).

Persistence of tobacco dependence comprised participants who (i) never smoked tobacco (n = 411), (ii) smoked tobacco daily at one or more assessment waves but were never diagnosed with tobacco dependence (n = 114), (iii) were diagnosed at one wave (n = 92), (iv) two waves (n = 81), (v) three waves (n = 52), and (vi) four or more waves (n = 67).

Covariates

The following covariates were included in all statistical tests: sex, methylation array control probe principal components indexing technical variation, and white blood cell counts. White blood cell counts (neutrophils, lymphocytes, monocytes, eosinophils, and basophils) were included because there can be differences in patterns of methylation between different cell types, and people differ in the distribution of these cells in blood. The following additional covariates were included in tests of dose-response associations due to their association with cannabis use and/or DNA methylation: childhood socioeconomic status [42], low childhood self-control [43], family history of substance dependence [44], and persistent use of other substances (tobacco, alcohol, other illicit drugs) [45, 46].

Statistical analyses

DNA methylation data were not normally distributed and showed evidence of outliers, influential observations, and, for some methylation markers, non-normally distributed residuals. Robust regression is appropriate when data contain outliers and influential observations. Further, research has shown that non-normally distributed residuals do not bias epigenome-wide association findings in terms of false negatives or false positives [47].

We used robust regression to compare groups on age-45 CpG sites, with groups represented by dummy codes. We used robust regression to test dose-response associations between quantitative exposures (persistence of regular cannabis use, persistence of tobacco dependence) and age-45 CpG sites. In tests of dose-response associations, covariates were added sequentially: sex, methylation array control probe principal components, and white blood cell counts (Model 1); aforementioned covariates plus childhood SES, low childhood self-control, and family history of substance dependence (Model 2); and aforementioned covariates plus persistent use of other substances (Model 3).

Prior to analyses, methylation probes were standardized on the full age-45 cohort (N = 818, M = 0.00, SD = 1.00). Standardized mean differences between groups can be interpreted as effect sizes, with 0.2, 0.5, 0.8 reflecting small, medium, and large effects, respectively [48]. Results present nominal p-values and indicate which CpG sites survived Bonferroni correction. All statistical tests are two-sided.

Results

A comparison of long-term cannabis users, long-term tobacco users, and non-users

Groups were compared on the 246 CpG sites shown in previous studies to be associated with cannabis use.

Long-term cannabis users showed both hypomethylation and hypermethylation at age 45 relative to lifelong cannabis/tobacco non-users, but mostly hypomethylation (Fig. 2A). Of 246 CpG sites, 35 differed between long-term cannabis users and non-users at α = 0.05, and 17 at α adjusted for 246 tests (Table 1).

Fig. 2: A comparison of long-term cannabis users, long-term tobacco users, and lifelong cannabis/tobacco non-users on age-45 DNA methylation.
figure 2

The x axis is the mean difference between long-term cannabis users and cannabis/tobacco non-users (Panel A), long-term tobacco users and cannabis/tobacco non-users (Panel B), and long-term cannabis users and long-term tobacco users (Panel C) on the 246 CpG sites at age 45. Mean differences were estimated using robust regression and were adjusted for sex, control probe principal components indexing technical variation, and white blood cell counts. Negative mean differences indicate hypomethylation relative to the comparison group, and positive mean differences indicate hypermethylation. The y-axis is the negative log base 10 p-value, and these were truncated at a value of 100. The dashed vertical line is the reference line that represents no difference in means between groups. Panel A=Long-term Cannabis Users (N = 73) vs. Cannabis/Tobacco Non-users (N = 182). Panel B=Long-term Tobacco Users (N = 56) vs. Cannabis/Tobacco Non-users (N = 182). Panel C=Long-term Cannabis Users (N = 73) vs. Long-term Tobacco Users (N = 56). The colors indicate statistical significance: Blue= p < 0.00020 (significant at Bonferroni-adjusted threshold for 246 tests). Red= p < 0.05. Gray= p ≥ 0.05.

Table 1 Comparison of long-term cannabis users, long-term tobacco users, and cannabis/tobacco non-users on age-45 DNA methylation.

Long-term tobacco users showed both hypomethylation and hypermethylation at age 45 compared with lifelong cannabis/tobacco non-users, but mostly hypomethylation (Fig. 2B). Of 246 CpG sites, 36 differed between long-term tobacco users and non-users at α = 0.05, and 17 at α adjusted for 246 tests (Table 1).

Long-term cannabis users and long-term tobacco users generally did not differ in terms of age-45 DNA methylation (Fig. 2C). Of 246 CpG sites, 17 differed between long-term cannabis users and long-term tobacco users at α = 0.05, and 3 at α adjusted for 246 tests (Table 1).

Tests of dose-response associations for persistence of regular cannabis use and persistence of tobacco dependence

Greater persistence of regular cannabis use from age 18–45 was associated with differential DNA methylation of 52 of 246 CpG sites at α = 0.05, and 17 at α adjusted for 246 tests (Table 2, Model 1). Associations were largely unchanged after adjusting for childhood SES, low childhood self-control, and family history of substance dependence, with 45 of the 52 significant CpG sites from Model 1 remaining differentially methylated at α = 0.05, and 15 at α adjusted for 52 tests (Table 2, Model 2). A number of associations decreased substantially after additionally adjusting for persistence of tobacco, alcohol, and illicit drug use. Nonetheless, 19 of the 45 significant CpG sites from Model 2 remained differentially methylated at α = 0.05 after adjusting for all covariates, and 9 of these were significant at α adjusted for 45 tests (Table 2, Model 3).

Table 2 Dose-response associations between persistence of regular cannabis use from age 18–45 and age-45 DNA methylation.

Greater persistence of tobacco dependence was associated with differential methylation of 57 of the 246 CpG sites at α = 0.05, and 22 at α adjusted for 246 tests (Table 3, Model 1). Associations were largely unchanged after adjusting for childhood SES, low childhood self-control, and family history of substance dependence, with 47 of the 57 significant CpG sites from Model 1 remaining differentially methylated at α = 0.05, and 22 at α adjusted for 57 tests (Table 3, Model 2). Associations were only slightly attenuated after adjusting for persistence of cannabis, alcohol, and illicit drug use, with 37 of the 47 significant CpG sites from Model 2 remaining differentially methylated at α = 0.05 after adjustment for all covariates, and 20 of these were significant at α adjusted for 47 tests (Table 3, Model 3).

Table 3 Dose-response associations between persistence of tobacco dependence from age 18–45 and age-45 DNA methylation.

Summary of group comparisons and tests of dose-response associations

Table S5 summarizes the cannabis-related CpGs from tests of group comparisons and tests of dose-response associations. Of the 246 candidate CpG sites that we sought to replicate, 35 replicated in tests comparing long-term cannabis users with non-users and 52 replicated in tests of persistence of regular cannabis use, using comparable covariates across the two approaches (sex, control probe principal components indexing technical variation, white blood cell counts) and using α = 0.05. After adjusting for multiple testing, those numbers reduced to 17 (Table 1) and 17 (Table 2, Model 1) CpG sites, respectively, with 16 CpGs in common across the two approaches. When dose-response associations were additionally adjusted for substantive covariates, including childhood SES, low childhood self-control, family history of substance dependence, and persistence of tobacco, alcohol, and illicit drug use, nine CpGs remained associated with persistence of regular cannabis use at α adjusted for multiple testing (Table 2, Model 3: cg05575921, cg21566642, cg03636183, cg21161138, cg01940273, cg17739917, cg05086879, cg02978227, cg23079012). By comparison, there were 17 tobacco-related CpG sites that survived adjustment for multiple testing in group comparisons and were robust to covariate-adjustment and adjustment for multiple testing in tests of dose-response associations. The 9 cannabis-related CpG sites that replicated in tests of group comparisons and covariate-adjusted tests of dose-response associations were the focus of the next two analyses: (1) a test of associations between methylation sites and gene expression, and (2) a test of whether quitters showed less extreme hypomethylation than long-term cannabis users.

Association between DNA methylation and gene expression levels

To test whether the nine cannabis-related methylation sites represent meaningful biological associations, we examined correlations between methylation site levels and whole-genome gene expression (N = 45,374 probesets). Methods are described in Supplemental Text. Gene expression was measured when study members were 38 years old. Therefore, these analyses used gene expression and DNA methylation data collected concurrently at age 38. Because DNA methylation data were assayed using different arrays at age 38 and age 45, three of the nine cannabis-related CpG sites at age 45 were not present in the age-38 data. Of the remaining six CpG sites (cg05575921, cg21566642, cg03636183, cg21161138, cg01940273, cg23079012), all were associated with long-term cannabis use at age 38, like at age 45 (Table S6), and all showed statistically significant negative associations with gene expression levels after adjustment for multiple testing (Fig. S3). Specifically, across the six cannabis-related CpG sites, hypomethylation was associated with higher expression of AHRR, P2RY6, LRRN3, GPR15, and DSC2. Two sites were at the AHRR locus (cg05575921 and cg21161138).

A comparison of cannabis and tobacco quitters with non-users and long-term users

Cannabis quitters showed a pattern of DNA methylation that was intermediate between long-term cannabis users and lifelong non-users (Fig. 3A, Table S7).

Fig. 3: A comparison of cannabis and tobacco quitters with cannabis/tobacco non-users and long-term users.
figure 3

Panel A Cannabis quitters versus cannabis/tobacco non-users and long-term cannabis users. Panel B Tobacco quitters versus cannabis/tobacco non-users and long-term tobacco users. Means are crude means, standardized (M = 0.00, SD = 1.00) on the representative cohort. Therefore, means of 0.00 reflect the representative cohort norm. Error bars are 95% confidence intervals. Statistical tests used robust regression, with covariates (sex, principal components indexing technical variation, and white blood cell counts). Asterisks (*) indicate that the comparison with the quitter group was statistically significant at α adjusted for multiple testing (cannabis = 9 tests; tobacco = 17 tests).

Likewise, tobacco quitters showed a pattern of DNA methylation that was intermediate between long-term tobacco users and lifelong non-users (Fig. 3B, Table S8).

Further, longer cannabis quit length was dose-dependently associated with increases in methylation at age 45, after adjusting for all covariates (Table S9). Specifically, whereas long-term cannabis users who had not quit showed hypomethylation at age 45, former cannabis users showed less extreme hypomethylation with each additional increment in quit length (i.e., longer quit length was positively associated with methylation).

Discussion

This study characterized the DNA methylation profiles of long-term cannabis users in midlife, re-evaluating a set of 246 methylation sites shown in previous research to be associated with cannabis use. Key findings are as follows. First, 16 sites of cannabis-related differential DNA methylation were replicated across our comparisons of long-term cannabis-user versus non-user groups and our tests of dose-response associations, the latter of which used prospectively-assessed persistence of regular cannabis use from age 18–45 as the continuously-measured exposure. Second, when tests of dose-response associations were adjusted for a number of potential confounders (childhood socioeconomic deprivation; low childhood self-control; family history of substance dependence; and persistence of tobacco, alcohol, and illicit drug use), persistence of regular cannabis use remained robustly associated with hypomethylation of nine sites: cg05575921, cg21566642, cg03636183, cg21161138, cg01940273, cg17739917, cg05086879, cg02978227, cg23079012. Third, the nine sites were not specific to long-term cannabis users; long-term tobacco users showed hypomethylation of those same sites. Fourth, cannabis-related methylation sites showed biologically meaningful associations with gene expression. Fifth, the group of cannabis quitters showed less extreme DNA hypomethylation than long-term cannabis users, suggesting that cessation of cannabis use has the potential to reverse altered DNA methylation.

It is worth noting that most of the previously published markers of cannabis-associated differential DNA methylation were not replicated in this study. One factor might be our focus on cannabis use that is long-term, a difference from several prior studies that reported on less sustained use. However, our record can be compared to the literature where among the 246 previously published cannabis-related CpG sites, only one was reported by more than one study: cg05575921 [16, 20, 22] (Table S2). Another factor may be test-retest reliability. Our team previously documented the highly variable, but on average low, test-retest reliability of ~450,000 CpG sites assayed once using the 450 K BeadChip and once using the EPIC BeadChip, in the same DNA sample at the same time (mean intraclass correlation [ICC] = 0.21) [49]. Using that database (https://osf.io/83ucs/), we obtained the test-retest reliability of the 246 CpG sites that were the focus of the current study. Of the 246 sites, 156 had test-retest data because they were present on both the 450k and EPIC BeadChips, and their average ICC was only 0.28 (SD = 0.30, range = −0.13, 0.98). Sites that do not yield the same value if re-tested have limited utility. Importantly, the sites with better test-retest reliability showed the largest effect sizes in our tests of dose-response associations for persistence of regular cannabis use (r = 0.32, p < 0.0001) and persistence of tobacco dependence (r = 0.35, p < 0.0001). One possibility for future research, which may improve reproducibility, is to pre-screen for reliable CpG sites [49]. Another possibility is to use epigenetic measures that are enriched for reliable CpG sites [49]. For example, with one exception [50], several recent studies have shown that cannabis use is associated with accelerated epigenetic aging [51,52,53], using epigenetic clocks that correlate with physiological deterioration and a broad range of health indices.

Despite the relatively low replication rate, nine cannabis-related methylation sites were robust to adjustment for a range of covariates, and six of those sites with available gene expression data (cg05575921, cg21566642, cg03636183, cg21161138, cg01940273, cg23079012) were related to gene expression in a biologically meaningful way. Specifically, cannabis-related hypomethylation was associated with higher expression of the following genes: AHRR, P2RY6, LRRN3, GPR15, and DSC2. These genes are implicated in the metabolism of environmental toxins; immune system function and inflammatory response; and neurodevelopment, synaptogenesis, and neuroplasticity [54,55,56,57,58]. Higher expression is linked with some cancers (AHRR, DSC2) [59, 60], poorer prognosis of lung cancer (P2RY6) [55], and cardiovascular diseases (GPR15, DCS2) [61, 62]. Most of these genes have been implicated in tobacco smoking. Our findings are consistent with previous studies documenting that the cannabis-related methylation sites reported here are related to tobacco smoking and lung cancer (Table S10). However, unlike tobacco, cannabis use is not clearly associated with lung cancer [1] and is inconsistently associated with elevated risk of cardiovascular events [63].

The high degree of overlap between the methylation profiles of long-term cannabis users and long-term tobacco users has implications for the specific mechanisms underlying cannabis-related differential methylation. Although our findings showed that some associations between long-term cannabis use and DNA methylation were robust to adjustment for long-term tobacco use, it is difficult to disentangle cannabis effects from tobacco effects because most people who use cannabis also use tobacco. One possibility is that our analyses of long-term cannabis use did not fully account for tobacco effects, despite covariate adjustment, and tobacco use explains the cannabis-related differential methylation. However, another possibility is that cannabis and tobacco may independently affect DNA methylation. Importantly, one of the nine cannabis-associated sites from our study, cg05575921, has been shown to be associated specifically with tobacco smoking and not vaping or use of non-combustible tobacco products [8], which suggests smoking as a potential common mechanism. We found that long-term cannabis users and long-term tobacco users showed hypomethylation of cg05575921, but the effect was smaller for long-term cannabis users. The smaller effect for long-term cannabis users, and lack of a clear association between cannabis use and smoking-related health outcomes such as lung cancer, may be explained by the less intensive patterns of use typical of cannabis relative to tobacco. As cannabis use intensity increases [15], associations with lung cancer and other diseases could emerge.

A key question is whether THC exposure might also be a mechanism by which cannabis alters methylation. Rodent studies have shown that THC administration alters DNA methylation [64]. However, if THC were a mechanism underlying cannabis-related methylation in the present study, it is perhaps surprising that there were few methylation sites that were unique to cannabis. In this regard, it is worth noting that THC and nicotine affect shared brain pathways, including pathways that underlie reward, mood, and cognition. For example, despite binding to different brain receptors, THC and nicotine result in altered neurotransmission (e.g., increased dopamine release) [65], which could affect the epigenome in similar ways [66]. Moreover, the endocannabinoid and nicotinic systems appear to interact, with one system modulating the other [65]. Strikingly, epidemiological studies have documented that cannabis and tobacco relate similarly to a number of health outcomes, including psychosis [67], cognitive impairment [68], MRI-assessed brain structure [41], and periodontal disease [69]. This epidemiological evidence underscores the possibility that cannabis and tobacco use share biological pathways to health outcomes.

Quitting cannabis may at least partially reverse cannabis-related alterations to the epigenome. Cannabis quitters showed DNA methylation profiles that were intermediate between long-term cannabis users and non-users. Tobacco quitters showed the same pattern. The finding for tobacco has been shown before [70, 71] and is consistent with research documenting that tobacco cessation improves health outcomes [72]. The finding for cannabis is new and suggests that cannabis cessation may improve health outcomes.

This study has limitations. First, findings are based on a single New Zealand (NZ) cohort born in the 1970s when cannabis was much less potent (i.e., characterized by lower THC concentrations) than it is today [73, 74]. Relatedly, the heaviest cannabis users in the Dunedin cohort had used cannabis on an almost daily basis for years, but they comprised only ~6% of the cohort. Recent data from the US show that the number of daily cannabis users is increasing [15]. If long-term frequent use of high-potency cannabis underlies cannabis effects on methylation, then our study may have underestimated cannabis-methylation associations. In terms of how NZ currently compares with the US and Europe, medical cannabis is legal in NZ but non-medical use (i.e., recreational use) is not, similar to many American states. The NZ medical cannabis market has shifted toward high-THC products, and the potency of medical and illegal market cannabis products in NZ is similar to the US [75]. The rate of disability adjusted life years attributable to cannabis use disorder is higher in NZ than in the US and UK [76].

Second, the DNA is from blood, and research is needed to determine if findings replicate in other tissues (e.g., lung, which is obtained much more invasively than blood). Third, the study cannot address the mechanisms of cannabis-associated hypomethylation. Most people who use cannabis smoke it [4, 5], and although smoking is one plausible mechanism, research is needed to ascertain whether users who primarily vape or ingest cannabis also show differential DNA methylation.

Fourth, this paper focused on cannabis use. We included tobacco comparisons because tobacco associations provide important context for interpreting cannabis associations. However, there are tobacco-associated DNA methylation markers that we do not report on here, because we selected previously published cannabis-associated methylation markers for replication. Nonetheless, all 17 tobacco-associated markers of differential DNA methylation that replicated across tests of group comparisons and tests of dose-response associations are reported as tobacco-associated in the EWAS catalog in the same direction as we report here. Fifth, in group comparisons, small N’s may have reduced statistical power (n = 49–182). However, our dose-response tests had ample power (n = 787). Finally, this study is observational and cannot determine causality. Nonetheless, we found evidence of dose-response associations, which bolsters biological plausibility of a causal effect. Moreover, dose-response tests addressed confounding by adjusting for factors known to characterize long-term cannabis users: socioeconomically deprived circumstances, poor childhood self-control, family history of substance dependence, and persistent tobacco, alcohol, and illicit drug use. The field generally lacks tests of dose-response associations. Our findings represent a unique contribution.

This study has several implications. First, long-term cannabis use could affect DNA methylation in similar ways to tobacco, possibly at least partly though the shared mechanism of smoke inhalation. Second, research is needed to ascertain whether cannabis-related hypomethylation and associated alterations in gene expression result in the development of disease. Third, at a time when cannabis and cannabinoids are increasingly touted and used for salutary effects, it is important to recognize that the epigenome may be responsive to long-term cannabis use regardless of whether use is for medical or non-medical reasons. Finally, cannabis cessation could reverse altered DNA methylation.