Introduction

C-reacting Protein (CRP) is an acute-phase protein1. It is mainly produced by hepatocytes, but several studies indicate additional extrahepatic synthesis, e.g. in adipocytes or smooth muscle cells of the coronary arteries under inflammatory conditions2,3. Polymorphisms have been identified in the human CRP gene at locus 1q23.2 that are strongly associated with CRP values in human populations4. CRP as a member of the pentraxin family shows reactivity with ligands widely distributed in organisms, with inducing the complement system by interaction with the binding-sites5. Several inflammatory conditions such as rheumatoid arthritis, cardiovascular diseases, and infections show elevated expression of CRP6. Due to severe tissue damage like injury, infection or inflammation, CRP-concentration rises in a cytokine-mediated response within 24–72 h, while highest concentrations of CRP triggered by bacterial infections have been found in serum, increasing CRP values up to 1,000-fold1,7,8,. Plasma half-life of CRP is 18–20 h9.

CRP values in human populations are known to be influenced by age and sex, but also by factors such as lipid levels, blood pressure, body weight, estrogen (i.e., in hormone replacement therapy), nicotine- and alcohol consumption, which are in turn related to health behaviors, psychosocial stress and material living conditions and may potentially mediate the impact of socioeconomic factors on CRP1,4. Indicators of individual socioeconomic position (SEP), such as educational attainment and household income, are well-investigated determinants for diseases connected to inflammation and for inflammatory biomarkers including CRP10,11,12,13,14,15. For instance, Liu et al.16 have found that the influence of socioeconomic factors in childhood and adolescence plays a role for adult CRP values. In their study, an up to 25% elevated inflammatory level in the least advantaged group of participants (compared to the most advantaged) has been shown (24)16. In a meta-analysis, the association of educational degree and the complex regulatory inflammatory cascade trigger has been investigated with CRP showing the strongest social differences with higher CRP values in lower compared to higher educational groups, even after adjusting for health behaviors and body mass index14. Next to SEP playing an important role for the average values of CRP in blood serum of healthy subjects, variation in the CRP gene is a well-known determinant17. While in recent large scale genome-wide association studies (GWAS) up to 266 independent genetic loci have been reported to be associated with values of circulating CRP, genetic variants in the CRP gene locus continue to be among the strongest single genetic determinants of CRP values in humans18,19. It has been hypothesized that the incomplete penetrance of genetic variants of complex traits such as CRP values may at least partly be explained by gene by environment interactions20. For a range of complex traits that are associated with CRP values in human populations, gene by SEP interaction has been discussed10,21,22,23,24, with SEP serving as a context-defining indicator of social differences in non-genetic risk factors. While the association between low SEP and high CRP values has been observed in numerous studies and gene by SEP interactions have been shown for several inflammation-associated complex traits, the interaction between CRP locus genetic variation and SEP indicators has not been investigated to date.

The aim of the present study was to investigate to what extent the effect of allelic variation in the CRP gene single nucleotide polymorphism (SNP) rs4287174 on CRP values interacts with the SEP indicators education and income in a population-based study sample.

Methods

Study population

Baseline data of the Heinz Nixdorf Recall Study, a population-based prospective cohort, was used. The rationale and design of the study has been described in detail elsewhere25. In brief, 4,814 women and men between 45 and 74 years were randomly selected from mandatory registries of residence from the cities Bochum, Essen, and Mülheim/Ruhr within a large metropolitan region in the western part of Germany. The baseline response rate (recruitment period: December 2000 to June 2003) was 55.8% 26. From all participants written informed consent was obtained and the approval for the study was given by the ethics committee of the University of Duisburg-Essen in addition to extended quality management procedures and certification according to DIN ISO 9001:2000. The study adhered to the tenets of the declaration of Helsinki.

Indicators of socioeconomic position (SEP)

Income categories and educational attainment assessed at the baseline examination in standardized face-to-face interviews were used as SEP indicators. Education (total years theoretically spent in education combining the highest formal school degree and highest vocational training) was categorized according to the International Standard Classification of Education27. For statistical analysis, four educational groups were created with the lowest educational group of ≤ 10 years of formal education equivalent to a minimum compulsory school attendance and no additional vocational degree and the highest education group of ≥ 18 years equivalent to a vocational training including additional qualification or a university degree. Income category was converted in the monthly household equivalent income calculated by dividing the total household net income of the participants by a weighting factor for each household member. Four groups were defined for income based on sex-specific quartiles. To investigate the individual contribution of SEP indicators education and income, they were analyzed separately28,29.

Genetic data

SNP rs4287174 has been reported as lead variant in a recent 1000Genomes GWAS on chronic inflammation being in strong linkage disequilibrium (LD) with rs2794520 (r2 = 0.98), the lead variant at the CRP locus in other recent CRP GWAS18,19.Thus, rs4287174 was used as genetic marker for variation at the CRP locus in the study population with T coded as the CRP increasing allele (i.e., effect allele). The effect allele frequency in the analysis population was 0.65. Genotyping was performed using the Infinium Global Screening Array (GSA chip) by Illumina. No deviation from Hardy–Weinberg equilibrium was detected for rs4287174 genotypes.

C-reactive protein

Blood serum high-sensitivity CRP (mg/dl) was measured using a standardized assay (Roche Diagnostics, Basel, Switzerland) at the central laboratory of the University Hospital of Essen, Germany30. Blood serum samples were analyzed within 12 h after being collected and stored at 4 °C. Lower limit of detection was 0.015 mg/dl. Participants below the detection limit were set to 0.010 mg/dl.

Risk factors

Levels of total cholesterol were derived from blood serum samples by using standardized enzymatic methods and analyzed within 12 h after collection at the central laboratory of the University Hospital of Essen, Germany. Diabetes mellitus was defined as either of the following criteria: reported history of diabetes mellitus, taking glucose lowering drugs, having fasting blood glucose levels of > 125 mg/dL, or having nonfasting glucose levels of ≥ 200 mg/dL. Coronary artery calcification (CAC) was quantified at the baseline examination by noncontrast-enhanced electron beam computed tomography using a C-150 scanner (GE Imatron, South San Francisco, CA). A value of zero indicates nondetectable calcification. Current smoking was defined as smoking cigarettes during the last 12 months. Body mass index (BMI) was computed from standardized measurements of height and weight (kg/m2). Hypertension was defined as having a systolic value of ≥ 140 mm Hg or a diastolic value of ≥ 90 mm Hg or taking regular antihypertensive medication. Blood pressure was assessed using an automated oscillometric device (Omron HEM-705-CP) and calculated as the mean of the second and third values of 3 measurements. Physical activity was defined as no exercise versus exercise one and more times per week. Dietary intake was assessed by a validated 21-item food frequency questionnaire. Based on the food frequency questionnaire, a cardioprotective dietary pattern index was calculated to determine the quality of the participants’ diet (i.e., the higher the dietary pattern index score, the healthier the diet)23.

Statistical analysis

Out of the HNR-study cohort population (n = 4,814), participants with prevalent coronary heart diseases at baseline or missing information on prevalent coronary heart diseases (n = 342), as the main aim of the HNR study was the assessment of coronary artery calcification in a study sample free of coronary heart disease. Participants with missing data for SNP rs4287174 (n = 394) and for CRP values (n = 33) were also excluded from analysis (Fig. 1). Missing observations for education (n = 10), income (n = 263) and risk factors were excluded only from the respective analyses. Participants with prevalent coronary heart diseases at baseline or missing information on prevalent coronary heart diseases had moderately higher CRP values and income compared to the analysis population, while the distribution across educational groups did not differ (Suppl. Table 1). Participants with missing information on income were more likely to report low education and to be female, but did not differ in average CRP (Suppl. Table 2).

Fig. 1
figure 1

Participants of the Heinz Nixdorf Recall (HNR) study excluded from the analysis population (CHD: coronary heart disease; SNP: single nucleotide polymorphism; CRP: C-reactive Protein).

To characterize the analysis population (n = 4065), descriptive statistics were calculated for the whole analysis population as well as stratified by rs4287174 genotype. Age- and sex-adjusted linear regression models were fitted to quantify the associations of SEP indicators and SNP rs4287174 with CRP. A loge-transformation of (CRP + 1) was applied to approximate a symmetric distribution of the outcome. Effect size estimates and 95% confidence intervals (95%-CI) derived by linear regression models were presented back-transformed as exp[β] to interpret results on the original scale. Exp(beta) can be interpreted as the relative change in the geometric mean of CRP per unit increase in the independent variable. SNP rs4287174 was included in assuming a (log-)additive genetic model. To quantify the association of education and income with CRP, dummy variables were used with the highest SEP category as reference. To analyze the genetic effect of SNP rs4287174 on CRP in different SEP groups, age- and sex-adjusted linear regression models were fitted stratified by educational group and income quartile. The association between SEP indicators and CRP was also analyzed stratified by rs4287174 genotype. Age- and sex-adjusted linear regression models were fitted to assess strength of interaction between rs4287174 genotype and SEP indicators on CRP by including SEP group by rs4287174 interaction terms next to SEP and rs4287174 main effects in the model. Education and income were again analyzed separately and included as dummy variables with the highest category as reference. In addition, a single reference joint analysis was applied to assess the combined effect of SEP group and rs4287174 genotype on CRP. Dummy variables were created for all possible combinations of SEP group and rs4287174 genotype with the group of high SEP indicator and A allele CRP genotypes as reference. For this purpose, rs4287174 genotypes A/A and A/T were combined, as the group with high SEP and A/A genotype would have been too small to sufficiently serve as reference group with adequate statistical power.

SEP is likely to have no direct effect on CRP values, but its effect may be mediated by inflammatory risk factors associated with SEP. Potential gene by SEP interactions in the present study may thus be mediated by interactions of the CRP gene with risk factors affecting CRP values. Therefore, risk factors (i.e., body mass index (BMI), total cholesterol, diabetes mellitus, coronary artery calcification, current smoking, hypertension, diet, no exercise) were additionally included in the main interaction model as potential mediators (not confounders) of CRP gene by SEP interactions. According to Keller (2014)31,to properly adjust for covariates in a regression model including interaction terms it is necessary to not only include main effects but also covariate interaction terms. To avoid overfitting of regression models and to investigate, which one of the covariates may have a substantial impact on observed gene by SEP interactions, each covariate was included separately in the main interaction model. All analyses were performed using IBM ® SPSS® Statistics Version 27.

Results

Sex and the average age did not strongly differ between the genotype groups (Table 1). The median CRP value showed an average of 0.15 (IQR: 0.07–0.32) mg/dl in the analysis population, while higher values were observed with each additional rs4287174 T allele. The distribution of household income (€/month) and years of education did not differ across genotype groups.

Table 1 Characteristics of the analysis population (n = 4065), stratified by SNP rs4287174 genotype.

The main genetic effect of rs4287174 indicated a 1.045-fold (95%-CI: 1.033; 1.057) average CRP value per additional T allele (Table 2). The lowest education group showed on average 1.101-fold (95%-CI: 1.064; 1.139) CRP values, compared to the highest education group. Regarding income, 1.055-fold (95%-CI: 1.029; 1.081) average CRP values were observed in the lowest compared to the highest income quartile. For study participants in the intermediate SEP groups indication for higher average CRP values was also observed compared to the reference categories, leading to a negative trend in the magnitude of effect size estimates across SEP groups.

Table 2 Age- and sex-adjusted relative effect size estimates (exp(beta)) and 95% confidence intervals (95%-CI) for the association of SNP rs4287174 alleles, education groups (years) and income quartiles with CRP values assessed in separate linear regression models.

The rs4287174 genetic effect on CRP stratified by education showed the strongest effect size estimate for the lowest education group with an exp(ß) of 1.058 (95%-CI: 1.018; 1.100), while in the highest education group the genetic effect nearly disappeared (exp(ß) of 1.005 (95%-CI: 0.975; 1.037)) (Fig. 2). A decrease in strength of effect size could be observed with increasing education, although in the intermediate education groups the effect size estimates were similar in magnitude (Fig. 2). When stratified by income quartiles, the strongest genetic effect could be observed in the second quartile with an exp(ß) of 1.057 (95%-CI: 1.034; 1.082), but no strong differences in the effect size estimates for the rs4287174 genetic effect on CRP were observed across income quartiles (Fig. 2).

Fig. 2
figure 2

Age- and sex-adjusted relative genetic effects (exp(beta)) of SNP rs4287174 alleles on CRP values, stratified by education groups (years) and income quartiles in linear regression models; 95%-CI: 95% confidence interval.

When analyzing the association between education and CRP values stratified by rs4287174 genotype, the strongest effect size estimates were observed in the T/T genotype group showing an exp(ß) of 1.152 (95%-CI: 1.086;1.223) for the lowest compared with the highest education group (Suppl. Table 3).

The results in Table 3 indicated a positive interaction between rs4287174 alleles and low (≤ 10 years) as well as intermediate (11–13 years) education compared to the highest education group. The lower the educational level, the stronger were the effect size estimates for interaction by rs4287174 across educational groups. However, no interaction between effect alleles and income was found.

Table 3 Age- and sex- adjusted relative effect size estimates (exp(beta)) and corresponding 95% confidence intervals (95%-CI) for the interactions of rs4287174 alleles with education groups (years) and income quartiles on CRP values in separate linear regression models.

In the analysis of joint effects using a single reference group of participants within the highest education group and with A allele rs4287174 genotypes, effect size estimates showed a clear trend within and between education groups (Table 4). The lower the education group and the higher the number of CRP-increasing rs4287174 alleles the stronger the association with CRP. The strongest effect size estimate was observed for the group with the lowest education and two CRP-increasing rs4287174 alleles showing an exp(ß) of 1.156 (95%-CI: 1.102; 1.214) compared to the reference group.

Table 4 Age- and sex-adjusted single reference joint effects (exp(beta)) and corresponding 95% confidence intervals (95%-CI) for all possible combinations of education group and rs4287174 genotype on CRP values with the group of highest education and CRP genotypes A/A and A/T as reference.

For the risk factors separately included in the main interaction model the effect size estimates for the rs4287174 by education interaction terms differed only slightly. For none of the risk factors an interaction with rs4287174 was indicated (Table 5).

Table 5 Age- and sex- adjusted relative effect size estimates (exp(beta)) and corresponding 95% confidence intervals (95%-CI) for the interactions of rs4287174 alleles with education groups (years) on CRP values additionally including risk factor main effects and rs4287174 by risk factor interactions terms in separate linear regression models.

Discussion

The results of the present study gave indication for interaction between allelic variation in the CRP gene SNP rs4287174 and education on CRP values in a population-based study sample. This was supported by (1) genetic effects stratified by education group showing stronger effect size estimates in groups of lower education, (2) positive rs4287174 T allele by low education interaction in linear regression models including interaction terms and (3) single reference joint effects showing the highest CRP values in the group with the lowest education and two rs4287174 T alleles. The observed interaction did not seem to be mediated by risk factors associated with SEP.

The results for the association of SEP indicators with CRP values are in line with previous studies that established the link between low SEP and higher CRP values in adult populations14,32. There are many hypotheses on the underlying cause of the association between SEP and negative health outcomes in general: lower education might for example correlate with limited knowledge about health behaviors and health and low health literacy, but also adverse housing and working conditions, reduced material resources and increased physical strain11,28,29. Low SEP is associated adverse lifestyle factors (e.g., poor diet, smoking, low physical activity) leading to a higher prevalence of adverse factors (e.g., hypertension, high total cholesterol, high BMI) that increase the risk for a range of common complex diseases. Such risk factors could act as mediators and account for the association of SEP with CRP values: Kershaw et al.33 assumed health behavior such as smoking, diet and physical activity would explain the major proportion of the association between SEP and CRP. According to a previous study low-grade inflammation (possibly indicated by elevated CRP values) might be the connection between well-known cardiovascular risk factors and cardiovascular disease32. In addition, according to the psychosocial explanation model for health inequalities also adverse psychosocial factors that are more prevalent in low SEP groups may directly increase CRP levels via psychoneuroendocrine and psychoneuroimmunological pathways34. In accordance with the present study results inflammation might also be a mediator between SEP and cardiovascular disease32.

The SNP rs4287174 showed a strong main effect on CRP values in this study, which is in accordance with the meta-analysis of Ligthart et al.18. However, the genotype is not solely responsible for basal CRP upregulation. Main results of the present study indicated that education may have an impact on the expression of the CRP genotype effect on CRP values. While stronger genetic effects were observed within groups of lower education, it is unlikely that education has a direct effect on the expression of genetic effects. Inflammatory risk factors more prevalent in lower SEP groups may potentially mediate the observed interaction. Several previous studies have identified factors associated with SEP interacting with the CRP genotype. Eiriksdottir et al.35 showed interaction between another SNP variant (rs1205) and BMI on circulating CRP values. The authors hypothesized that the adipose tissue might be producing a factor that modulates the expression of the CRP gene. Another study investigated the interaction of three CRP SNPs with plasma fatty acids on the inflammatory profile and presented positive results for rs1417938 36. This was regarded as an indication for nutritional habits influencing the CRP gene expression, which is underlined by the results of Nienaber-Rousseau et al.37 showing interaction between several CRP SNPs and glucose fasting on inflammation. Yuan et al. (36)38 saw interaction of a CRP genetic risk score with copper, so occupational exposure might also play a role. All these factors – BMI, nutrition and occupational exposure – are known to be influenced by SEP11. They have also been discussed as potentially modifying epigenetic profiles, which could be hypothesized as one biological pathways for the observed interaction39,40. However, after including additional risk factors as potential mediators of the CRP gene by education interaction in the present analysis, the observed interaction did not seem to be mediated by these risk factors. As there was also no indication for interactions between the CRP gene and risk factors themselves, previous interactions reported for, e.g., BMI and diet, may have been confounded by SEP. In conclusion, there have to be other possible pathways through which SEP (as indicated by education in the present study) could affect the association of CRP genotype and circulating CRP levels.

In contrast to education, there was no indication for interaction between income and rs4287174 in this study. A possible explanation might be that the measurement of income is a snapshot which can alter due to changes in employment status (such as unemployment periods or retirement), while education is a constant characteristic usually completed early in life that reflects SEP in childhood, young adulthood and over early life course28. When Geyer et al.29 compared the SEP-indicators education, income and occupational position with regard to several health outcomes, differing results were observed related to the strength of associations. Education showed the strongest associations in the German study sample especially for diabetes mellitus and incident coronary events. They concluded that the investigated indicators reflect different dimensions of SEP with disparate effects on health outcomes. Income might represent dimensions of SEP which do not interact with CRP genotype, such as material resources and living conditions.

Besides the population-based study sample, the possibility to compare two different SEP indicators in the analyses is a strength of the present study. The sample size provided adequate statistical power for detecting the obtained genotype by education interaction effect size estimates, but was not sufficient for multiple testing of all CRP loci discovered to date. However, the evidence for interaction was not only based on estimating interaction terms but also on stratified and joint effect analyses. Results have to be replicated in additional study populations to check external validity. It would also be of interest, if interaction with SEP could be shown for other genetic loci associated with CRP. Additionally, due to the participants’ average age of about 60 years the impact of SEP on CRP gene expression in younger subjects remains unclear.

Despite these limitations, the current results gave supporting evidence for the interplay of SEP with CRP gene expression. We showed that a CRP genotype interacts with the SEP indicator education to influence individual CRP values in a population-based study sample and discussed possible pathways through which the influence might occur. While the effect of single common genetic variants on phenotypes such as CRP may be to small to be an intervention target or to be used for risk prediction in clinical routine, findings of this study highlight the strong impact of SEP even on genetically influenced aspects of health and underlines the importance of tackling health inequalities by reducing social inequalities.