Introduction

Pleural effusion is the abnormal accumulation of fluid within the pleural space. It can be broadly categorized as malignant pleural effusion (MPE) and benign (BPE)1,2. MPE is defined as pleural effusion owing to the metastasis of malignancies into the pleural space or primary pleural tumours. Lung adenocarcinoma is the most common tumour that causes MPE, followed by breast and ovarian cancers1,3. The median survival period of MPE patients is less than one year4,5. MPE must be carefully diagnosed because a misdiagnosis can cause a heavy financial burden as well as psychological distress for patients. The reference standards for diagnosing MPE are cytological analysis, pleural biopsy, and thoracoscopy6,7,8,9. However, these diagnostic tools have certain limitations. Cytological analysis has a specificity of 1.00; however, approximately half of MPE patients may be missed10,11. Thoracoscopy or pleural biopsy can also confirm MPE; however, the invasive nature of this approach limits its clinical application in certain patients (e.g., those with coagulation disorders)12. In addition, the diagnostic accuracy of cytological analysis, pleural biopsy, and thoracoscopy may vary among diagnosticians.

Pleural fluid tumour markers are potential tools for diagnosing MPE owing to their advantages of easy access, low invasiveness, rapidity, and objectiveness2. Our previous study revealed that pleural fluid human epididymal protein 4 (HE4) can assist in diagnosing MPE, with an AUC of 0.7513. However, the factors that can affect its diagnostic accuracy remain unknown. Notably, previous studies reported that serum HE4 levels are affected by age14,15 and sex15. Based on the aforementioned evidence, we hypothesize that age and sex confound the diagnostic value of HE4.

In this post hoc study, we investigated the impacts of sex and age on the diagnostic value of HE4. We prepared this work following the reporting guidelines of diagnostic test accuracy studies16.

Methods

Participants

This was a post hoc study in which the participants were enrolled in our previous study13. The participant recruitment inclusion and exclusion criteria were reported in our earlier studies17,18. Briefly, we recruited patients with pleural effusions with undetermined causes who were admitted to Affiliated Changshu Hospital of Nantong University (June 2020 to July 2021; Changshu cohort) and the Affiliated Hospital of Inner Mongolia Medical University (September 2018 to July 2021; Hohhot cohort). The inclusion criteria were as follows: (i) pleural effusion patients whose underlying aetiology was unknown and (ii) patients who received thoracentesis. Ultrasound, chest X-ray, or CT were used to confirm the presence of pleural effusion. The following participants were excluded: (i) patients who had a pleural effusion history during the last three months and whose aetiology was clear; (ii) patients under 18 years old; (iii) pregnant women; (iv) patients without sufficient effusion for research purposes; (v) patients whose pleural effusion developed during hospitalization; and (vi) patients whose pleural effusion was caused by trauma or operation.

The Ethics Committees of the Affiliated Changshu Hospital of Nantong University (No. 2020-KY-009) and the Affiliated Hospital of Inner Mongolia Medical University (No. 2018011) approved this study. All the participants signed an informed consent. We conducted the study in accordance with the Declaration of Helsinki.

Diagnostic criteria

The reference standards for differentiating pleural effusion have been described previously13. Briefly, the reference standards for MPE include a cytological analysis and pleural histology. In patients with negative cytology results who were unwilling or unable to undergo pleural biopsy or thoracoscopy, MPE was defined as evidence of a primary tumour, clinical picture, and exclusions of BPE. Tuberculous pleural effusion (TPE) was diagnosed by pleural fluid Ziehl-Neelsen staining, Mycobacterium tuberculosis (Mtb) culture, or the presence of granulomas confirmed by pleural biopsy. In patients with no microbiologic evidence and who were unwilling or unable to undergo pleural biopsy, TPE was defined based on the antituberculosis treatment response and the exclusion of other pleural effusions. Parapneumonic pleural effusion (PPE) was defined based on the history, presence of pus cells in the effusion, bacterial culture, chest imaging characteristics, pleural biopsy (with evidence of neutrophilic infiltration), and the response to antibiotic therapy19. Congestive heart failure (CHF)-induced pleural effusion was diagnosed based on Light’s criteria, ultrasound and chest imaging characteristics (bilateral pleural effusion), serum chemistry (e.g., natriuretic peptides), echocardiographic features (e.g., enlarged cardiac chambers, decreased left ventricular ejection fraction), and treatment response to diuretics. Two senior clinicians (Zhi-De Hu and Li Yan) interpreted the medical records of participants and independently made the final diagnosis; disagreements were resolved by consensus. The clinicians who diagnosed all the patients were unaware of the levels of HE4 in the pleural fluid.

Pleural fluid HE4 assay

After admission, a 10-mL effusion sample was collected into a tube without any anticoagulant. After centrifugation, the supernatant was collected, aliquoted, and stored between − 80 °C and − 70 °C. The ARCHITECT i2000SR analyzer was used to measure HE4 levels in both cohorts. According to the internal quality control, the intra-assay coefficient of variation (CV) was 4.65% at 148 pmol/L. The upper limit of HE4 detection was 1500 pmol/L. The laboratory technicians who performed the HE4 analysis were blinded to the clinical details of the patients.

Statistical analysis

We employed a receiver operating characteristic (ROC) curve to evaluate the accuracy of HE4 in diagnosing MPE. The area under the curve (AUC) was used to measure the global diagnostic accuracy of HE4. Two methods were used to test the effects of age and sex on the accuracy of HE4. First, participants were categorized into subgroups according to their sex (male and female) and age (< 55 and ≥ 55 years old), and the AUCs of HE4 in each group were calculated. A threshold of 55 years of age was selected because a previous study revealed that HE4 increased with age, particularly among women ≥ 55 years20. We used Delong’s method to compare the AUCs21. Second, we resampled the participants with different upper age limits and calculated the AUC of HE4 in each resampled dataset. The principles of this statistical method have been described in our previous studies22,23,24. All the statistics and graphs were produced with R (version 4.3.2). A p-value of less than 0.05 was considered to indicate statistical significance.

Results

Characteristics of the participants

The Hohhot cohort comprised 66 patients with MPE and 86 patients with BPE, whereas the Changshu cohort comprised 26 patients with MPE and 32 patients with BPE. The participant selection process and essential characteristics of the participants in both cohorts were described in a previous study13.

Factors affecting the accuracy of HE4 for diagnosing MPE

Figure 1A shows the results of the subgroup analyses. We failed to observe a significant difference in the AUC of HE4 according to sex (p > 0.05 for all). However, the AUCs of HE4 in younger patients (< 55 years old) were greater than those in older patients (≥ 55 years old) in the Hohhot cohort (p = 0.02) and the combined cohort (< 0.001) (Fig. 1B).

Fig. 1
figure 1

AUCs of HE4 in diagnosing MPE in different subgroups. The bar represents the 95%CI of the AUC.

In the next step, the participants were resampled using different upper age limits, and the AUCs of HE4 were calculated for each resampled dataset. Figure 2 shows the relationship between the AUCs and the upper age limits in each resampled dataset. The AUCs of HE4 decreased as the upper age limit increased in both cohorts.

Fig. 2
figure 2

Relationships between the AUCs of HE4 and the upper age limits for each resampled dataset.

Discussion

Our previous study revealed that HE4 has a moderate diagnostic accuracy for MPE13; however, the factors that can affect the accuracy of HE4 remain unclear. With two independent cohorts, this study revealed that sex did not affect the accuracy of HE4. However, we found that the accuracy of HE4 decreased as age increased. Therefore, we conclude that age should be considered when interpreting the accuracy of pleural HE4 for detecting MPE. To our knowledge, no study has analysed the effects of age and sex on the accuracy of pleural HE4 for diagnosing MPE.

This study used two methods to explore how age and sex affect the accuracy of HE4 diagnosis: subgroup analyses and resampling of participants with different upper age limits. Subgroup analyses are commonly used to investigate how certain factors impact the accuracy of a diagnostic test. However, these methods have limitations when analysing the accuracy of continuous data. The thresholds used to define subgroups can theoretically affect the results. One strength of this study is that we resampled the participants using different upper age limits and calculated the AUCs of each dataset for comparison. This method considers nearly all the possible thresholds and thus provides more theoretically reliable results than subgroup analyses. Both analytical methods revealed that older patients had a decreased AUC, suggesting that age negatively affects the accuracy of HE4. Another strength of this study was that we used two cohorts to assess the effects of age on the accuracy of HE4, making our findings reliable.

Other studies have reported that age can influence serum HE4 levels15,25,26. Our study revealed that HE4 has an AUC of approximately 0.75, whereas previous studies reported that the AUC of HE4 was > 0.80 for diagnosing MPE27,28,29. This inconsistency may be attributed to the median age of the participants in our study being > 70 years. Conversely, the average ages in previous studies ranged from 60 to 65 years. These findings also support our conclusion that HE4 has a low diagnostic accuracy for MPE in older patients. The adverse effects of age on the accuracy of HE4 are biologically plausible. Serum HE4 is affected by various factors, such as heart failure30, diabetes31, hypertension31, and renal impairment25. The prevalence of these disorders increases in older patients, thus impairing the diagnostic accuracy of HE4. Although sex does have an impact on serum HE4 concentrations in healthy individuals, the effects are minor15,32. Various disorders, such as cancer and inflammation, can significantly increase HE4 levels in body fluids33. Consequently, the complex pathogenesis of pleural effusion may outweigh the influence of sex on pleural HE4 levels.

Despite being the first study to investigate the confounding factors affecting the diagnostic value of HE4 for MPE, there were certain limitations. The small sample size is the first limitation, which may impair the precision of our results. Although there were two cohorts in our study and the findings were reproduced in both cohorts, the sample size of the Changshu cohort was small. Therefore, external cohorts are needed to validate our findings. The second limitation is that > 90% of the participants in our study had a normal renal function, hindering the study of renal-function impact. The third limitation is that the stored pleural fluid specimens were used to measure HE4 levels; however, the long-term stability of pleural HE4 remains unknown.

In summary, the diagnostic accuracy of pleural HE4 for MPE decreased in older patients, whereas sex had no significant impact. Therefore, age should be considered when using pleural fluid HE4 levels to diagnose MPE. Owing to the limitations of this study, further research with larger sample sizes is necessary to validate our findings.