Introduction

Interstitial lung diseases (ILDs) are a group of heterogeneous lung diseases with pulmonary alveolar unit inflammation or interstitial fibrosis that are associated with substantial morbidity and mortality. The causation of ILD includes idiopathic and specific etiology including autoimmune disease, vasculitis, drugs, tumors and occupational or environmental exposure1,2. Idiopathic pulmonary fibrosis (IPF) is one of a family of idiopathic interstitial pneumonias characterized by usual interstitial pneumonia (UIP) in high-resolution computed tomography (HRCT) and pathology3. IPF has a poor prognosis with median survival from the time of diagnosis approximately 3 years4. Fibrosing ILDs other than IPF, such as connective tissue disease (CTD) associated ILD, including ILD associated with rheumatoid arthritis (RA-ILD), systemic sclerosis (SSc-ILD) and polymyositis/dermatomyositis are also known to have progressive disease behaviors similar to IPF5,6,7. Therefore, it is important to recognize the risk factors associated with poor prognosis in patients with fibrosing interstitial lung disease.

Several biomakers has been reported to be as diagnostic and prognostic biomarkers of fibrosing ILD, including Krebs von den lugen-6 (KL-6), Surfactant proteins A and D (SP-A and SP-D), serum interleukin 6 (IL-6) levels and tumor markers8,9,10. However, the relationship between proportion of each serum marker and fibrosing ILD is not clear. Therefore, in this study, we retrospectively studied the clinical characteristics of patients with fibrosing ILD and established a novel model to better guide personalized therapeutic choices in persons.

Materials and methods

Study subjects

We retrospectively reviewed 647 patients who were diagnosed of fibrosing ILD (IPF and CTD-associated UIP) from inpatient of the department of respiration of Nanjing Drum Tower Hospital from February 2017 to February 2020. Overall, 308 patients were excluded based on exclusion criteria. A total of 339 patients were analyzed as a derivation cohort(Fig. 1A). To validate the value of clinical scoring system to predict the survival of patients with fibrosing ILD, a validation cohort was performed which consisted of 98 patients with fibrosing ILD who were admitted to the department of respiration of Nanjing Drum Tower Hospital between February 2020 and February 2021(Fig. 1B). Patients with incomplete data were excluded. Exclusion criteria for all fibrosing ILD subjects were: (1) subjects had combined pneumonia, lung malignancy, or other pulmonary diseases; (2) subjects lacked of pulmonary function test results; (3) subjects of validation cohort overlapped with derivation cohort. We analyzed demographic features, clinical characteristics, lung function parameters and therapy. Survival status was determined by reviewing the medical records or telephone follow-ups until February 2021.

Figure 1
figure 1

Flow diagram describing the selection of the study population.

This study was consented by Ethics Committee of Nanjing Drum Tower Hospital. The Ethics Committee waived the need for informed consent as the study was retrospective and the data were analyzed anonymously.

Methods

The diagnosis for IPF was mainly based on the criteria from An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline3. The diagnosis of CTD-ILD referred to the published guideline11. Clinical information at admission was collected including demographics, smoking history. Pulmonary function tests including forced vital capacity (FVC), FVC% predicted, diffusion capacity for carbon monoxide (DLCO), and DLCO% predicted were extracted for analysis. All subjects had UIP pattern on chest HRCT as defined by the guidelines from the American thoracic society and the European respiratory society3,11.

Statistical analysis

Data were expressed as mean ± standard deviation (SD). t-Test or the Mann–Whitney U test was used for continuous variables. Categorical variables were compared by Chi-square test. The independent prognostic role of variables were evaluated by Cox proportional hazard analysis. Receiver operator characteristic (ROC) analyses were performed to calculate area under the ROC curve (AUC) of markers for predicting the prognosis of fibrosing ILD. The Kaplan–Meier method was used to assess survival curves with GraphPad Prism version 7 (Graph Pad Software Inc., La Jolla, CA, USA). The log-rank test was used to evaluate the statistical significance of differences between the higher ACDS score and lower ACDS score groups. Statistical analyses were performed using SPSS18.0 statistical software. Statistical significance was considered at the 0.05 levels.

Ethics approval and consent to participate

The study was conducted according to the Declaration of Helsinki. This study was approved by Ethics Committee of Nanjing Drum Tower Hospital, the Affiliated Hospital of Nanjing University Medical School.The Ethics Committee of Nanjing Drum Tower Hospital, the Affiliated Hospital of Nanjing University Medical School waived the need for informed consent as the study was retrospective and the data were analyzed anonymously.

Results

Baseline clinical characteristics of fibrosing ILD patients

The baseline clinical features of subjects with IPF (n  =  132) and CTD-ILD characterized by UIP on HRCT (n  =  207) were summarized in Table 1. Male gender, older age were more common in the IPF group (p < 0.001 and p < 0.001, respectively). Smoking history was similar. Red blood cell distribution width(RDW) levels, serum total bilirubin (TBIL) and direct bilirubin (DBIL) also differed between the two groups (p  =  0.008, p < 0.001 and p  =  0.001, respectively). Patients with CTD-ILD characterized by UIP on HRCT had a higher DLCO% predicted level compared with the IPF patients.

Table 1 Baseline clinical features in the derivation cohort.

Constructing a scoring system for predicting prognosis of fibrosing ILD

According to the final follow-up data, 339 fibrosing ILD patients were divided into survivors group (n = 259) and decedents group (n = 80). As was shown in Table 2, there was no difference in the proportion of CTD-UIP and IPF among the survivors group and decedents group (p = 0.072). Cox proportional hazards models were used to examine the influence of variables on the prognosis of patients with fibrosing ILD. The multivariate cox regression analysis showed that smoking history (HR = 3.826, p = 0.001), age (HR = 1.043, p = 0.015), carcinoem-bryonic antigen (CEA) (HR = 1.059, p = 0.049), cytokeratin 21–1(CYFRA21-1) (HR = 1.177, p = 0.004) and DLCO%predicted (HR = 0.979, p = 0.032) were independent prognostic factors for fibrosing ILD (Table 3).

Table 2 Comparison between survivors and decedents in fibrosing ILD patients of the derivation cohort.
Table 3 Prognostic factors for survival by univariate and multivariate Cox regression models in fibrosing ILD patients of the derivation cohort.

The accuracy of independent prognostic factors for predicting the survival of fibrosing ILD was then evaluated by Receiver Operating Characteristics (ROC) analysis. The area under the ROC curve for CYFRA21-1 in predicting the survival of fibrosing ILD was 0.85 (95% CI, 0.80–0.90; p < 0.001). The prediction ability for smoking history, age, CEA and DLCO%predicted were listed in Table 4. Then, we constructed a simple clinical scoring system for predicting survival of fibrosing ILD with the variables of smoking history, age, CEA, CYFRA21-1 and DLCO%predicted (Table 5).

Table 4 Comparisons of ROC curve analysis for predicting the survival of fibrosing ILD patients.
Table 5 Development of a clinical scoring system to predict survival of interstitial pneumonia characterized by UIP in HRCT.

Association of clinical scoring system with survival of patients with fibrosing ILD in the validation cohort

ROC curve was calculated to compare the predictive value of the scoring system in the derivation cohort. The ROC curve was shown in Fig. 2. The area under the curve of the scoring system for predicting survival of fibrosing ILD was 0.90 (95%CI: 0.87–0.94, P < 0.001). The cutoff value was 2.5 with their corresponding specificity (90.7%) and sensitivity (78.8%). In the validation cohort, the patients were divided into a higher ACDS score group (n = 42, ACDS score ≥ 2.5) and a lower ACDS score group (n = 56, ACDS score < 2.5) to analyze the survival using the Kaplan–Meier method (Fig. 3). The log-rank test showed a significant difference in survival between the two groups (p < 0.001).

Figure 2
figure 2

ROC curve of the scoring system for predicting survival of fibrosing ILD in the derivation cohort.

Figure 3
figure 3

The respective Kaplan–Meier curve of fibrosing ILD patients with lower ACDS score group and higher ACDS score group in the validation cohort.

Discussion

The present study retrospectively compared the clinical and follow-up data between 259 survivors and 80 decedents with fibrosing ILD in the derivation cohort. In this study, we demonstrated that smoking history, age, CEA, CYFRA21-1 and DLCO% predicted could predict the survival of fibrosing ILD patients independently. A new predictive scoring system namely ACDS (age [A], CEA and CYFRA21-1 [C], DLCO% predicted [D], and smoking history [S]) was proposed. Importantly, we found that scoring system level was closely associated with the prognosis of fibrosing ILD patients. Furthermore, we demonstrated that patients with relatively low ACDS score had significantly longer overall survival than patients with relatively high ACDS score in the validation cohort.

Fibrosing ILD had similar biological and clinical behaviours which was characterised by progressive deterioration in lung function, progressive deterioration in lung function and high mortality rate12,13. Investigating the prognostic value of markers across fibrosing ILD was of great importance to clinical evaluation and make continues to elucidate the approach to fibrosing ILD management. In the past few years, several serum markers were identified as simple and readily accessible biomarkers to predict the survival and severity of fibrosing ILD. There were researches studying tumor markers such as CEA , carbohydrate antigen 19–9 (CA 19–9) and CYFRA21-1 that might reflect the severity and prognosis of fibrosing ILD14,15,16. One retrospective study by Fahim A et al., which included 41 non-smoking patients with idiopathic pulmonary fibrosis(IPF), reported that serum CEA concentration was elevated in approximately half of patients with IPF and was correlated with disease severity17. These results were consistent with the finding of our study of CEA as a biomarker in fibrosing ILD patients. In our study, CEA was identified as an independent prognostic factor for fibrosing ILD. CEA is a glycoprotein involved in cell adhesion and is produced by colonic epithelium. It has reported that CEA localizes in metaplastic epithelium lining honeycombed bronchioles by immunohistochemical staining. As cuboidal pneumocytes are the predominant source of epithelial renewal in severe lung damage and fibrosis, these cells are the most likely source of CEA release18.

In this study, elevated serum levels of CYFRA21-1 were observed in decedents group with fibrosing ILD. In a study by Vercauteren et al., higher level of CYFRA 21–1 in BAL of IPF patients resulted in worse survival in comparison with the CYFRA 21–1 low counterpart19. The expression of CYFRA21-1 in the lung has been identified in bronchiolar epithelial cells and pneumocytes. Elevation of serum CYFRA21-1 concentration might be associated with lysis or regeneration of these cells15. Furthermore, we demonstrated that serum CEA and CYFRA21-1 were significantly correlated with decreased DLCO%predicted in this study. The severity of ILD is usually based on pulmonary function test results such as DLCO%predicted20. Thus, serum CEA and CYFRA21-1 levels might be useful for reflecting the severity of fibrosing ILD.

A large amount of studies reported that smoking was closely associated with the onset and progress of pulmonary fibrosis21,22. A possible explanation may be that cigarettes contain the cytotoxic, mutagenic and proinflammatory substances. According to previous reports, these substances caused cellular oxidative stress, increased epithelial cell apoptosis, and dysregulation of immune responses, which was responsible for the progress of pulmonary fibrosis23,24. In addition, smoking affects the function of macrophages. It induced macrophage polarization to M2 phenotype that enhance the regression of inflammation and tissue remodeling25. Therefore, smoking cessation could be a good way to slow down the development of pulmonary fibrosis in the patients with ILD.

In the past years, few models has been proposed to predict the severity and prognosis of IPF. Glasgow prognostic score (GPS) has been reported to play an important role in predicting mortality in patients with acute exacerbation of IPF26. In our study, smoking history, age, CEA, CYFRA21-1 and DLCO% predicted were identified as independent factors for predicting the prognosis of fibrosing ILD. Moreover, based on these variables, a new predictive scoring system namely ACDS (age [A], CEA and CYFRA21-1 [C], DLCO% predicted [D], and smoking history [S]) was proposed. The scoring system was demonstrated to be as a predictive value for the survival of fibrosing ILD. However, it still needs further perspective study to verify the power of this scoring system based on multicenter and large population of fibrosing ILD patients.

Some limitations of this study should be noted. First, this was a retrospective and observational study of data obtained from a single center. In addition, the mechanism underlying the association of each biomarker with fibrosing ILD remains to be clarified in further in vivo and in vitro studies.

Conclusions

In conclusion, smoking history, age, CEA, CYFRA21-1 and DLCO% predicted were independent predictors of the prognosis of fibrosing ILD patients that offers the advantages of convenience, ease of accessibility and low cost. A new predictive scoring system namely ACDS may help predict prognosis in patients with fibrosing ILD.