Introduction

Ovarian cancer is highly lethal and the fifth most common cause of cancer-related death among women in high-income countries1,2. The overall incidence of ovarian cancer in women over 40 years of age has decreased significantly possibly due to the increased use of oral contraceptives3,4. However, the incidence of young-onset ovarian cancer remained stable in most developed countries4,5,6. In Asia, an increase in the incidence of ovarian cancer was observed, especially among the young population4,5,6,7.

Young women with ovarian cancer face unique challenges throughout the course of their treatment, such as a diagnostic delay, iatrogenic menopause, infertility, incontinence, and sexual dysfunction8,9,10,11,12,13. In addition, young patients with ovarian cancer are at high risk for financial toxicity caused by the disease and its treatments, with very few enrollments in clinical trials14,15. Although preventing young-onset ovarian cancer is crucial, limited data exists on its modifiable risk factors, except for obesity or nulliparity16,17.

Ample evidence suggests that obesity increases ovarian carcinogenesis through persistent inflammation and oxidative stress18,19,20,21. However, the effect of non-alcoholic fatty liver disease (NAFLD) on the risk of young-onset ovarian cancer remains unclear. NAFLD is a multisystemic disease associated with chronic inflammation and oxidative stress22,23,24. Moreover, the prevalence of NAFLD among young adults in developed countries is 20–24%, with a 2.5-fold increase over the last three decades25,26. To the best of our knowledge, only one small study suggested the association between NAFLD and the risk of ovarian cancer in middle-aged women (median age, 54 years; 19 cases of ovarian cancer cases in the NAFLD group)27.

Therefore, we conducted a nationwide population-based cohort study to investigate the longitudinal association between NAFLD and the subsequent risk of young-onset ovarian cancer. We followed up more than 2.3 million young women aged 20–39 years for ten years.

Patients and methods

Data source

We used data from the South Korean National Health Insurance Service (KNHIS) database. The KNHIS is a government-run universal national health insurance program covering approximately 97% of the population. The KNHIS database includes demographic information, medical treatments, procedures, diagnoses (based on the International Classification of Diseases, 10 th Revision, Clinical Modification [ICD-10-CM]), hospital utilization, and national health screening results. The KNHIS provides a regular National Health Screening Program every two years to all insured individuals aged ≥ 20 years or employees, regardless of age. Approximately 76% of the target population participates in this program28. This national health screening consists of anthropometric measures, laboratory testing, and a self-administered questionnaire regarding past medical history and lifestyle factors.

This study was approved by the Institutional Review Board of the Samsung Medical Center (IRB No. SMC2021-05–066) and the KNHIS Big Data Steering Department (NHIS-2021–1–711). The requirement for written informed consent was waived because the KNHIS dataset was compiled after anonymization in accordance with stringent confidentiality regulations. This study adhered to the criteria outlined in the Declaration of Helsinki.

Study population

The selection process for the study population is illustrated in Fig. 1. A total of 2,755,790 women aged 20–39 years who underwent a national health examination between January 1, 2009, and December 31, 2012 were included. To minimize the impact of pre-existing diseases and avoid bias, we excluded participants who had been diagnosed with any cancer except non-melanoma skin cancer before cohort entry (n = 11,971) and those who developed cancer except non-melanoma skin cancer or died within the first year of follow-up (n = 5,965). In addition, based on the definition of NAFLD, we excluded individuals with heavy alcohol consumption (≥ 20 g of alcohol per day; n = 62,269), liver cirrhosis, and hepatitis (ICD-10-CM codes K70.3 and B15–B19, respectively) (n = 141,500). We excluded participants with missing data (n = 157,603). Ultimately, 2,376,482 women were included in this study. The participants were followed up until December 31, 2022, the development of young-onset ovarian cancer, or death, whichever occurred first.

Fig. 1
figure 1

Study population selection flow chart.

Anthropometrics and laboratory measurements

Health professionals evaluated the following characteristics during the national health screening. Wearing lightweight clothing, the participants’ height, weight, and waist circumference were measured. Abdominal obesity was defined as a waist circumference ≥ 85 cm for females, according to Asian standards29. The body mass index (BMI) was calculated by dividing the weight (kg) by the square of the height (m2). According to Korean standards, obesity was defined as a BMI ≥ 25 kg/m22,30. Abdominal obesity was defined as waist circumference ≥ 85 cm for women, according to Asian standards29.

Systolic and diastolic blood pressures were recorded in a seated position after a minimum of five minutes of rest. After overnight fasting, blood samples were collected and examined for glucose, creatinine, total cholesterol, triglycerides, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, and γ-glutamyl transferase (GGT). The estimated glomerular filtration rate (eGFR) was calculated using the Modification of Diet in Renal Disease study Eq31.. Chronic kidney disease was defined as an eGFR < 60 mL/min/1.73 m2.

Definition of non-alcoholic fatty liver disease

According to the European Association for the Study of the Liver’s international guidelines, noninvasive biomarkers such as the fatty liver index are the preferred methods for assessing NAFLD in large-scale population-based studies32,33,34. In the young, asymptomatic general population, liver biopsies or imaging studies are not feasible. We therefore utilized one of the best-validated fatty liver prediction models: The fatty liver index32,33.

The fatty liver index has good predictive value for detecting fatty liver35,36. This index ranges from zero to 100 and is computed as follows: (e0.953×Ln (triglycerides) +0.139×BMI+0.718×Ln (GGT) +0.053×waist circumference±15.745)/(1 + e0.953×Ln (triglycerides) +0.139×BMI+0.718×Ln (GGT) +0.053×waist circumference±15.745) × 10037. The optimal cut-off value of the fatty liver index to detect ultrasonography-diagnosed fatty liver was previously validated at ≥ 30 with an area under the receiver operating characteristic curve of 0.82 (0.81–0.84) in the general population of Korea35,38. Participants were categorized into three groups based on their NAFLD status: none (fatty liver index < 30); moderate (30 ≤ fatty liver index < 60); and severe (fatty liver index ≥ 60)35,36,39.

Definition of young-onset ovarian cancer

The primary endpoint of this study was the occurrence of ovarian cancer diagnosed in women aged 20 to 49. Young-onset ovarian cancer was determined by the ICD-10-CM code for ovarian cancer (C56) during hospitalization and the special reimbursement code for cancer (V193). Since 2006, the KNHIS policy has reduced co-payments for cancer-related examinations and treatments by 5%. Cancer diagnoses must therefore be certified by physicians and medical institutions using a specific reimbursement code (V193). This code adds all patients with a confirmed cancer diagnosis to the KNHIS national registry.

Clinical variables

During the national health screening, the participants respond to a standard self-administered questionnaire regarding alcohol consumption, smoking status, and level of physical activity. First, average daily alcohol consumption was determined by calculating the frequency of alcohol consumption per week and amount of alcohol consumed per occasion. Participants were then categorized as non-drinkers, light-to-moderate drinkers (< 20 g of alcohol per day), or heavy drinkers (≥ 20 g of alcohol per day). Each participant was classified as a never smoker, former smoker, or current smoker, based on their smoking history. Regular physical activity was defined as ≥ 20 min of vigorous-intensity physical activity ≥ 3 times per week, or ≥ 30 min of moderate-intensity physical activity ≥ 5 times per week. Those in the lowest quintile of the required insurance fee or those who received free medical care were categorized as having low-income status.

Diabetes was defined as a fasting glucose level ≥ 126 mg/dL or a minimum of one claim per year for an antidiabetic medication prescription (oral and/or injectable antidiabetic medication) under ICD-10-CM codes E11–E14. Dyslipidemia was determined based on total cholesterol levels ≥ 240 mg/dL or prescriptions for lipid-lowering drugs under ICD-10-CM code E78. Hypertension was defined as a systolic blood pressure ≥ 140 mmHg, diastolic blood pressure ≥ 90 mmHg, or antihypertensive drug prescription under ICD-10-CM codes I10–I13 and I15. The ICD-10-CM codes were used to define pelvic inflammatory disease (N73.9), endometriosis (N80), tubal ligation (R4341-R4345), and polycystic ovary syndrome (E28.2).

Statistical analysis

Baseline characteristics were analyzed using analysis of variance for continuous variables and the chi-square test for categorical variables. The incidence rates of young-onset ovarian cancer were computed by dividing the number of incident cases by the total follow-up duration and are expressed per 100,000 person-years. The log-rank test was used to compare the differences between Kaplan–Meier curves depicting the cumulative incidence probability of young-onset ovarian cancers in women with and without NAFLD. Using multivariate Cox proportional hazards regression models, hazard ratios (HRs) and 95% confidence intervals (CIs) were calculated for the associations between NAFLD in young women and young-onset ovarian cancer. Model 1 was not adjusted. Model 2 was adjusted for age. Model 3 was adjusted for age, smoking status, alcohol consumption, physical activity, BMI, diabetes, pelvic inflammatory disease, tubal ligation, endometriosis, and polycystic ovary syndrome. In addition, we examined the risk of young-onset ovarian cancer according to NAFDL severity. The proportional hazards assumption was tested through Schoenfeld residual and log–log plot. The test result was considered to satisfy the proportional hazards assumption. We also performed a sensitivity analysis, defining the outcome as ovarian cancer diagnosed before the ages of 30 and 45 years, respectively. Study participants were censored at the time they reached 30 or 45 years of age during follow-up. The level of statistical significance was set at P < 0.05, using two-sided tests. All statistical analyses were performed using the SAS software (version 9.3; SAS Institute, Cary, NC, USA).

Results

Baseline characteristics of the study population

During the 26,786,501 person-years of follow-up, 6,319 young-onset ovarian cancer cases were diagnosed, and the total number of deaths was 9,361 (mean ± SD age, 29.8 ± 5.2 years). The median follow-up duration was 11.5 years (interquartile range 10.4–12.2).

Table 1 shows the baseline characteristics of the study population based on the presence or absence of young-onset ovarian cancer. Women with young-onset ovarian cancer were older and had higher BMIs (all P < 0.01) than those without the disease. Additionally, women with young-onset ovarian cancer were more likely to have diabetes, hypertension, dyslipidemia, obesity, and endometriosis (all P < 0.01).

Table 1 Baseline characteristics of the study population according to the young-onset ovarian cancer.

Association between non-alcoholic fatty liver disease and the risk of young-onset ovarian cancer

The Kaplan–Meier curve in Fig. 2 demonstrates that the cumulative incidence probability of young-onset ovarian cancer was consistently higher in women with NAFLD than in those without NAFLD during the follow-up period (log-rank P < 0.01). Table 2 also shows that the incidence of young-onset ovarian cancer was higher among women with NAFLD than among those without NAFLD (P < 0.01).

Fig. 2
figure 2

Kaplan–Meier curves of incidence probability of young-onset ovarian cancer according to non-alcoholic fatty liver disease status (log-rank P < 0.01).

Table 2 Association between non-alcoholic fatty liver disease and the risk for young-onset ovarian cancer.

As shown in Table 2, NAFLD was associated with an increased risk of young-onset ovarian cancer in all models (all P < 0.01). After adjusting for potential confounding variables including obesity (Model 3), NAFLD remained significantly associated with an increased risk of young-onset ovarian cancer (hazard ratio [HR], 95% CI: 1.30, 1.16–1.45). The risk of young-onset ovarian cancer tended to increase with increasing severity of NAFLD (P for trend < 0.05). Moderate NAFLD and severe NAFLD were associated with an increased risk of young-onset ovarian cancer (Model 3; HR, 1.26; 95% CI, 1.12–1.41 and HR, 1.45; 95% CI, 1.22–1.72, respectively).

Supplementary Table 1 shows the results of ending the follow-up at the age of 45 years. The HR for young-onset ovarian cancer occurring before age 45 was 1.32 (95% CI, 1.10–1.58) in women with NAFLD after adjusting for potential confounders (Model 3). Supplementary Table 2 presents the findings when follow-up was restricted to age 30 years. In this analysis, the HR for young-onset ovarian cancer diagnosed before 30 years was 1.21 (95% CI, 0.60–2.44) in women with NAFLD (Model 3).

Discussion

In this nationwide cohort study of more than 2.3 million women aged 20–39 years, we found that NAFLD in young women was independently associated with an increased risk of young-onset ovarian cancer. The risk of developing young-onset ovarian cancer tended to increase as the severity of NAFLD increased.

Previously, a cohort study including women with a median age of 54 years reported that individuals with NAFLD had a higher incidence rate ratio of ovarian cancer than age-matched controls27. The study identified 19 cases of ovarian cancer in the NAFLD group, which was defined by diagnostic codes. In addition, a meta-analysis of observational studies found positive association between NAFLD and gynecological cancers, including those of the ovary, uterus, and cervical region (pooled random effects HR, 1.62; 95% CI, 1.13–2.32)40.

We provide new evidence that NAFLD in young women may be a modifiable risk factor for young-onset ovarian cancer, independent of obesity. Increasing physical activity, reducing fructose and cholesterol intake, and losing weight are recommended as management strategies for NAFLD41,42,43,44. Our findings present a significant potential opportunity to lower the risk of young-onset ovarian cancer, as there are few known risk factors in this age group.

The following mechanisms may explain the association between NAFLD and the risk of young-onset ovarian cancer. Individuals with NAFLD are more likely to have chronic low-grade inflammation, which may promote ovarian cancer development22,45. The hepatic production and systemic release of multiple proinflammatory cytokines and pro-oxidative mediators may promote ovarian carcinogenesis in NAFLD by promoting cell proliferation, anti-apoptosis, angiogenesis, and the production of free radicals that damage DNA and facilitate tumor initiation and development22,46. Moreover, individuals with chronic liver diseases, including NAFLD, have high estrone and estradiol levels, which is attributed to the increased peripheral conversion of androgens to estrogen in both males and females47,48. Ample evidence suggest that estrogen is associated with an increased risk of ovarian cancer49,50,51. Thus, the increased estrogen associated with NAFLD may be involved in ovarian carcinogenesis. In addition, NAFLD-associated microbiome alterations may be associated with ovarian carcinogenesis through altered host immunologic response and modulation of cell proliferation52,53,54,55.

Our study has several strengths. First, this nationwide cohort study was based on data from a large sample size of over 2.3 million young women, with a median follow-up duration of 11.5 years. Hence, we identified NAFLD as a modifiable risk factor for young-onset ovarian cancer, which has a significant disease burden but a relatively low incidence. Second, anthropometric measurements, blood test results, lifestyle factors, and extensive medical records from a nationwide database were used. After cohort entry, the clinical course was reliably monitored using the KNHIS database. Third, analyses were performed after controlling for several potential confounding variables including BMI, alcohol consumption, smoking status, physical activity, diabetes, and pelvic inflammatory disease. Fourth, we used both ICD-10-CM diagnostic codes (C codes) and national registration codes (V codes) to identify patients with young-onset ovarian cancer.

This study also has several limitations. First, we used the previously validated fatty liver index to evaluate NAFLD, and no liver biopsies or imaging data were included. Given the impracticality of performing liver biopsies or imaging tests on over 2.3 million young women, the fatty liver index is a useful diagnostic biomarker for NAFLD. Second, although the national health screening is free, it is voluntary, with a participation rate of approximately 76%. Individuals participating in national health screening are likely to represent a generally healthy population, as those with underlying symptoms or diseases are more likely to require hospital testing than a national health screening. Third, we could not include information on the participants’ familial cancer history, parity, hormonal contraceptive use, or the stage or histological subtype of ovarian cancer due to the lack of data in the KNHIS. Epithelial tumors account for over 90% of ovarian cancer in women over 30 years, while germ cell tumors predominate between 20 and 30 years15. Fourth, the Korean NHIS database does not have information on cases of loss of follow-up due to immigration. However, the number of immigrants every year is less than 0.001% of all Koreans. Fifth, self-reported information on alcohol consumption, smoking status, and physical activity levels may have been subject to recall bias.

In conclusion, this nationwide cohort study of more than 2.3 million women aged 20–39 years revealed that NAFLD in young women was significantly associated with an increased risk of young-onset ovarian cancer. Our findings suggest that NAFLD may be a modifiable risk factor for young-onset ovarian cancer. Additional research in other study populations is warranted to reduce premature morbidity and mortality associated with young-onset ovarian cancer among the next generation. Further investigation is also required to elucidate the precise mechanism by which NAFLD contributes to the carcinogenesis of young-onset ovarian cancer.