Introduction

Drug-induced interstitial lung disease (DILD) is a group of diffuse parenchymal lung disorders. DILD is caused by inflammation of the lung interstitium following exposure to over 380 known drugs, including cancer chemotherapy agents (e.g., paclitaxel and gemcitabine), amiodarone, and monoclonal antibody therapies (e.g., nivolumab and pembrolizumab)1,2,3,4. Although DILD exhibits numerous patterns in histological diagnosis, typically, diffuse alveolar damage (DAD), organizing pneumonia (OP), non-specific interstitial pneumonia (NSIP), eosinophilic pneumonia, and hypersensitivity pneumonia are observed5,6. These histopathological patterns as well as clinical phenotypes and computed tomography (CT) images of DILD vary significantly, even between patients receiving the same drug. As DILD is a serious adverse drug reaction that poses problems in drug development and clinical practice, new approaches are urgently required to specifically diagnose DILD. In addition, the variability of DILD properties makes distinguishing DILD from other related lung diseases complicated, including interstitial pneumonias caused by other factors4,5,6,7. Thus, along with DILD diagnosis, approaches to distinguish DILD from other lung diseases, especially idiopathic interstitial pneumonia (IIP), are also required.

DILD is traditionally diagnosed through comprehensive clinical evaluations, including laboratory tests for basic blood parameters and known lung biomarkers, chest radiography and/or high-resolution CT (HRCT), pulmonary function testing, and, if necessary, invasive procedures such as bronchoscopy4,5,6,7,8. Since biomarker tests have particular advantages in terms of low cost, minimal invasiveness, and ease of handling by general physicians, they are playing an increasingly important role in clinical drug use and development. The known lung biomarkers used to assist the diagnosis of DILD are surface protein-D (SP-D) and Krebs von den Lungen-6 (KL-6), which are glycoproteins produced by type II pneumocytes9,10,11. A prospective study revealed that KL-6 levels are increased in 53% of patients with DILD, which correlates with DAD and extensive lung involvement12. Meanwhile, it has been reported that SP-D can be used to discriminate between everolimus-treated patients with (n = 12) and without (n = 13) DILD13. However, these known biomarkers have also been shown to be changed by other lung diseases, such as lung cancer (LuCa), IIP, and lung disease associated with connective tissue disease (CTD)12,14,15,16. Since DILD can be caused by a broad spectrum of drugs and the underlying lung diseases vary between DILD patients, more specific biomarkers are required to facilitate the diagnosis of DILD and select appropriate treatments in a timely manner.

Technical advances in omics approaches have allowed the comprehensive analysis of circulating molecules, including RNA, proteins, and metabolites, to improve the discovery of candidate biomarkers. Lipidomics is a relatively recent omics approach that targets lipids17,18,19, a major class of metabolites that constitute the structural components of cells, organelles, and vesicle membranes, and act as a source of energy and/or cell signaling molecules. Accordingly, lipids are considered suitable targets for biomarker development. In this study, we used lipidomics analysis to obtain novel biomarkers for DILD, from plasma samples obtained from patients with acute and recovery phase DILD. Having identified lysophosphatidylcholines (LPCs) as candidate biomarkers for DILD, we validated these biomarkers in DILD-tolerant patients prescribed with DILD-causing drugs for at least 12 weeks. In addition, we verified the association between LPC(14:0), the most promising biomarker candidate for DILD, and the clinical characteristics of patients with DILD, as well as their ability to discriminate between DILD and other lung diseases.

Materials and methods

Subjects

All patients were recruited from Shinshu University, Nippon Medical School, Chiba University, and Hiroshima University. Healthy volunteers were recruited from the Yaesu Sakura-Dori Clinic. DILD was diagnosed by respiratory specialists according to the following Japanese diagnostic criteria6 that ameliorated the diagnostic criteria reported by Camus et al.5: 1) History of ingestion of a drug known to induce lung injury; 2) the clinical manifestations have been reported to be induced by a drug; 3) other causes of the clinical manifestations could be ruled out; and 4) improvement of the clinical manifestations after drug discontinuation.

Based on HRCT findings, DILD was classified into four patterns: DAD, NSIP, OP, and others, by respiratory specialists at each hospital. Some patients displayed multiple patterns.

DILD recovery was determined at least two weeks after the onset of DILD by respiratory specialists at each hospital, based on the improvement of clinical symptoms, lung imaging findings (e.g., HRCT), and oxygenation status (e.g., SpO2). Patients who were prescribed DILD-causing drugs without DILD for at least 12 weeks were enrolled in the DILD-tolerant group. Differences in general and background diseases and clinical characteristics among DILD groups were evaluated using Student’s t-test for numerical factors and the chi-square test for categorical factors. Patients with LuCa, bacterial pneumonia (BaPn), nontuberculous mycobacteriosis (NoMy), IIP, CTD, chronic obstructive pulmonary disease (COPD), or bronchial asthma (BrAs) were also diagnosed by respiratory specialists.

Sample collection

Blood samples were collected into vacuum blood collection tubes containing 7 mL EDTA-2Na by venipuncture (Venoject II, Terumo, Tokyo, Japan) and were immediately centrifuged at 2500 × g and 4 °C for 10 min. Plasma was dispensed into screw-capped polypropylene tubes and stored at − 80 °C within 2–4 h of drawing blood.

Lipidomics and candidate biomarker discovery

Lipidomics was performed as described previously20. The processed lipid level data are presented in Supplementary information 2. Lipids confirmed by fragment ions were used for data analysis. Significant differences in lipid levels were assessed using the false discovery rate (FDR)-adjusted Welch’s t-test, and candidate biomarkers were identified using the effect size (Hedge’s g).

LPC validated liquid chromatography/mass spectrometry (LC/MS) biomarker assay and biomarker validation and characterization

A validated LC/MS biomarker assay was performed on the LPCs as described previously21, with some modifications. The modified details and validated assay parameters and performance are described in Supplementary information 1 and Supplementary information 3. The determined LPC concentrations and associated clinical characteristics of patients with and without DILD are presented in Supplementary information 4.

Statistical analysis

LPC concentrations were compared between two or more groups using Student’s t-tests or ANOVA with Tukey’s post-hoc test in GraphPad Prism 9 (GraphPad Software, San Diego, CA, USA). The discrimination ability of different LPCs was assessed using the area under the curve (AUC) of receiver operating characteristic (ROC) analysis using GraphPad Prism 9. The correlation between LPC concentration and severity of symptoms in DILD was calculated using Pearson’s correlation coefficient. Multivariable logistic regression analysis of LPC concentration and associated clinical characteristics was performed using R 3.6.0 software (R Foundation for Statistical Computing, Vienna, Austria).

Ethics approval

This study was conducted in accordance with the Declaration of Helsinki and was approved by the Ethics Committee of the National Institute of Health Science (NIHS) (257 and 259 for NIHS; 261 and 263 for Kihara Memorial Foundation), Shinshu University (3318 and 4716), Nippon Medical School (27-11-514), Chiba University (2265), Hiroshima University (E-245), Daiichi Sankyo Co., Ltd. (15-0504-00), and Astellas Pharma Inc. (000043).

Consent to participate

Written informed consent was obtained from all study participants.

Results

Lipidomics screening of candidate biomarkers for DILD

First, we screened candidate biomarkers in 43 patients with DILD (43 acute phase samples; 30 recovery phase samples), whose clinical characteristics are summarized in Supplementary information 5. The screening set included 21 patients (21 acute phase; 17 recovery phase), while the validation set included 22 patients (22 acute phase; 13 recovery phase). Lipidomic analysis of the plasma samples identified 396 lipids (Supplementary information 2) that were quantified as arbitrary lipid units normalized to an internal standard (phosphatidylcholine(12:0/12:0)). As seen in Fig. 1A, a number of lipids exceeded the stringent threshold (FDR-adjusted adjusted p < 0.05 and effect size (|g|) > 1) level of difference between the acute and recovery phase samples, including 37 and 18 in the screening and validation sets, respectively (Fig. 1B). Then, lipids exceeding the threshold in both the screening and validation studies were extracted, which resulted in nine lipids being identified as candidate biomarkers (Fig. 1B), all of which were expressed at lower levels in the acute phase samples compared to that in the recovery phase samples. Since six of the nine candidate biomarkers were LPCs, we focused on LPCs in subsequent experiments.

Figure 1
figure 1

Lipidomics screening of candidate DILD biomarkers. (A) Individual lipid plot of false discovery rate (FDR) and effect size (g) from lipidomics analyses comparing acute and recovery phase DILD patients. (B) Venn diagram of significantly different lipids between acute and recovery phase DILD patients from the screening and validation cohorts.

Validation of LPCs as DILD biomarkers

To validate LPCs as candidate biomarkers for DILD, we established and validated the LC/MS biomarker assay to determine LPC plasma concentrations. Due to the availability of standards and the larger effect size of saturated/monounsaturated LPCs, we targeted the following eight LPCs: LPC(14:0), LPC(15:0), LPC(16:0), LPC(17:0), LPC(18:0), LPC(18:1), LPC(19:0), and LPC(20:0), based on the availability of analytical standards. The validated assay parameters (Supplementary information 3) were compatible with the acceptance criteria for bioanalytical drug validation methods (as described in Supplementary information 1). A total of 102 patients with DILD (102 acute phase; 59 recovery phase) and 31 DILD-tolerant patients were recruited for the validation assay, and their clinical characteristics are summarized in Table 1 and listed individually in Supplementary information 4.

Table 1 Summary of clinical characteristics of DILD and DILD-tolerant patients used for LPC biomarker assay.

Next, we compared plasma LPC concentrations between acute and recovery phase of DILD patients and DILD-tolerant patients (Fig. 2A). All eight LPCs were expressed at significantly lower levels in patients with DILD in the acute phase than in the recovery phase or in DILD-tolerant patients. ROC analysis was then performed to evaluate the discriminative performance of the individual LPCs between acute and recovery phase DILD patients (Fig. 2B and Supplementary information 6A) and between acute phase DILD patients and DILD-tolerant patients (Fig. 2C and Supplementary information 6B). The calculated AUC was highest for LPC(14:0) in both comparisons (0.813 for recovery phase; 0.811 for DILD-tolerant), with Yoden index cut-off values of 277.2 ng/mL (recovery phase) and 273.9 ng/mL (DILD-tolerant). Other LPCs scored AUCs of 0.737–0.783 for recovery phase patients and 0.738–0.763 for DILD-tolerant patients. The correlation between LPC(14:0) and the other LPCs in DILD patients was strong, with correlation coefficient (r) ranging from 0.65 to 0.86 (data not shown). This indicates that LPCs show similar trends in response to DILD, and that scoring models with multiple LPCs would not achieve better discrimination. Moreover, the calculated AUCs revealed that LPC(14:0) performed better as a biomarker for DILD than KL-6 and SP-D (Supplementary information 7), suggesting that LPC(14:0) could be a novel, high-performance biomarker for DILD.

Figure 2
figure 2

Validation of LPCs as DILD biomarkers. (A) Individual plot of plasma LPC concentrations in acute and recovery phase DILD and DILD-tolerant patients. ***p < 0.001 vs. acute phase. (B) ROC curve of LPC(14:0) concentrations between acute and recovery phase DILD patients. (C) ROC curve of LPC(14:0) concentrations between acute phase DILD patients and DILD-tolerant patients.

Association between LPC(14:0) and the clinical characteristics of patients with DILD

Plasma LPC(14:0) was next characterized in several aspects as a DILD marker. First, we investigated the relationship between DILD patterns and LPC(14:0) concentrations in acute and recovery of DILD patients. As shown in Fig. 3A, the DAD pattern had the strongest effect on plasma LPC(14:0) concentrations, but this effect was observed for the other DILD patterns. For the causal drug, the drug groups that belonged to the fourth level of the ATC code were selected to ensure statistical reliability, and the results with number of prescribed patients over 10 were shown (Fig. 3B). All groups of causal drug demonstrated a considerable impact on the plasma LPC(14:0) concentrations. A comparison of the acute phase and DILD-tolerant patients treated with the same group of causal drugs revealed that plasma LPC(14:0) concentrations were decreased due to DILD rather than the causal drugs (Supplementary information 8).

Figure 3
figure 3

Association of LPC(14:0) with DILD patterns, causal drugs, symptom severities, and other DILD biomarkers. (A) Plasma LPC(14:0) concentrations for each DILD pattern in acute and recovery phase DILD patients. ***p < 0.001 vs. recovery phase. (B) Plasma LPC(14:0) concentrations in acute phase DILD patients treated with specific causal drugs and recovery phase DILD patients. L01BC: pyrimidine analogs; L01CD: taxanes; L01XC: monoclonal antibodies; L01XE: protein kinase inhibitors. ***p < 0.001 vs. recovery phase. (C) Plasma LPC(14:0) concentrations with cough and breathlessness scores, body temperatures, and SpO2 levels. (D) Plasma LPC(14:0) concentrations in acute phase DILD patients with or without oxygen administration. ***p < 0.001. (E) Plasma LPC(14:0) concentrations with other DILD biomarkers.

Furthermore, we examined the association between plasma LPC(14:0) concentration and severity of symptoms in DILD, by assessing parameters such as cough, breathlessness, body temperature, oxygen administration, and SpO2 levels. The severity of cough and breathlessness were evaluated by the attending physician based on the classification criteria for severity of adverse drug reactions22 and scored from 0 (no symptoms) to 3 (severe symptoms). SpO2 levels were only evaluated in patients without oxygen administration, since oxygen administration results in amelioration of SpO2 levels. As shown in Fig. 3C, all severity scores of DILD symptoms correlated significantly with plasma LPC(14:0) levels, with breathlessness levels exceeding the absolute correlation coefficient of > 0.4. In addition, patients with oxygen administration demonstrated significantly lower plasma LPC(14:0) concentrations than those without (Fig. 3D). These results suggest that plasma LPC(14:0) concentration is associated with those symptom severities in DILD that reflect disease severeity.

Along with DILD pattern, causal drugs, and symptom severities, we examined the association between plasma LPC(14:0) concentrations and other clinical chemistry/biomarker levels in patients with and without DILD. As shown in Fig. 3E, plasma LPC(14:0) concentration correlated significantly and negatively with blood C-reactive protein (CRP) levels (r = -0.60). Significant correlations were also observed with other biomarker levels, except KL-6, but with absolute correlation coefficients of < 0.4.

Using multiple regression analysis, we examined the effects of the physical and clinical background characteristics of the patients, which may modulate the blood lipid profiles23,24,25. The physical characteristics examined were sex, age, body mass index (BMI), and smoking experience, and the clinical characteristics were chest radiation and associated diseases (LuCa, other cancers, high blood pressure, diabetes, lipid disorder, and heart disease). As shown in Fig. 4A, DILD was the primary factor contributing to plasma LPC(14:0) concentrations (p = 9.7E-13), and BMI was the only other factor that contributed significantly (p = 6.3E-4; Fig. 4B). To exclude the effect of BMI on plasma LPC(14:0) concentrations and evaluate the effect of DILD, we adjusted plasma LPC(14:0) concentrations for BMI using the beta obtained from multiple regression analysis and compared concentrations between acute and recovery phase DILD patients or DILD-tolerant patients. As shown in Fig. 4C, adjusted LPC(14:0) concentrations were significantly lower in patients with acute phase DILD than that in the recovery phase or DILD-tolerant patients. ROC analysis between acute and recovery phase DILD patients (Fig. 4D) and between acute phase DILD patients and DILD-tolerant patients (Fig. 4E) yielded AUCs of 0.807 for recovery phase patients and 0.801 for DILD-tolerant patients, consistent with unadjusted LPC(14:0) concentrations. These results suggest that BMI has a limited effect on plasma LPC(14:0) concentration and does not affect its application as a biomarker for DILD.

Figure 4
figure 4

Association between LPC(14:0) and patient background characteristics. (A) Multivariable logistic regression analysis of the characteristic contribution to plasma LPC concentrations. (B) Plasma LPC(14:0) concentrations with BMI. (C) BMI-adjusted plasma LPC(14:0) concentrations in acute and recovery phase DILD and DILD-tolerant patients. ***p < 0.001 vs. acute phase. (D) ROC curve of BMI-adjusted LPC(14:0) concentrations between acute and recovery phase DILD patients. (E) ROC curve of BMI-adjusted LPC(14:0) between acute phase and DILD-tolerant patients.

Ability of LPC(14:0) to discriminate between DILD, other lung diseases, and healthy volunteers

Finally, we examined the ability of plasma LPC(14:0) concentrations to discriminate between DILD and other lung diseases, LuCa (n = 68), BaPn (n = 10), NoMy (n = 20), IIP (n = 39), CTD (n = 23), COPD (n = 13), and BrAs (n = 12), or healthy volunteers (n = 90). Patient background characteristics are summarized in Table 2. Although there were several differences in background characteristics between patients with DILD and other lung diseases, multiple regression analyses demonstrated that background characteristics had a negligible effect (Fig. 4A). As shown in Fig. 5A, plasma LPC(14:0) concentrations were significantly lower in patients with acute phase DILD than in all other lung disease groups, except for BaPn and NoMy. ROC analysis (Fig. 5B) revealed that LPC(14:0) had substantial discriminative performance (> 0.7) between patients with acute phase DILD and other lung diseases, except for BaPn, and healthy volunteers (0.749 for LuCa, 0.563 for BaPn, 0.708 for NoMy, 0.750 for IIP, 0.818 for CTD, 0.744 for COPD, 0.788 for BrAs, and 0.798 for healthy volunteers). In addition, the AUCs against LuCa, IIP, and CTD revealed that LPC(14:0) performed better than KL-6 and SP-D (Supplementary information 9). The determined Yoden index cut-off values for LPC(14:0) were 368.1 ng/mL for LuCa, 274.3 ng/mL for IIP, and 392.6 ng/mL for CTD. These findings suggest that LPC(14:0) is an effective biomarker that can distinguish DILD from lung diseases, such as LuCa, IIP, and CTD, better than classical biomarkers.

Table 2 Summary of background information of patients with other lung diseases and healthy volunteers used for LPC biomarker assay.
Figure 5
figure 5

Ability of LPC(14:0) to discriminate DILD from other lung diseases and healthy volunteers. (A) Plasma LPC(14:0) concentrations in acute phase DILD patients, other lung diseases, and healthy volunteers. ***p < 0.001, *p < 0.05 vs. acute phase DILD. LuCa; lung cancer, BaPn; bacterial pneumonia, NoMy; nontuberculous mycobacteriosis, IIP; idiopathic interstitial pneumonia, CTD; lung disease associated with connective tissue disease, COPD; chronic obstructive pulmonary disease, BrAs; bronchial asthma, Healthy; healthy volunteer. (B) ROC curve of LPC (14:0) concentrations between acute phase DILD patients and other groups.

Discussion

This study identified and characterized DILD biomarkers from patients with DILD, DILD-tolerant patients, patients with other lung diseases, and healthy volunteers. Lipidomics analysis screening and validation using patients with acute and recovery phase DILD revealed a systemic decrease in LPC class molecules, with larger effect size of saturated/monounsaturated fatty acid side chains in acute phase patients. Further analysis of plasma concentrations demonstrated that LPCs were present at significantly lower concentrations in the acute phase DILD, with LPC(14:0) exceeding the performance of the classical biomarkers KL-6 and SP-D. Notably, LPC(14:0) had no apparent association with causal drugs, or patient backgrounds and was associated with disease severity and CRP levels, unlike with KL-6 or SP-D levels. Furthermore, LPC(14:0) concentration can discriminate between patients with DILD and those with LuCa, IIP, CTD, COPD, and BrAs, exceeding the performance of KL-6 and SP-D biomarkers in LuCa, IIP, and CTD cases. Therefore, this study demonstrated that LPCs, particularly LPC(14:0), could be a novel and effective biomarker for DILD.

Blood LPC levels have previously been proposed as candidate biomarkers for acute inflammatory diseases. For instance, serum LPC levels are decreased in patients with community-acquired pneumonia and are associated with their prognosis and mortality26,27. Consistently, we found that plasma LPC(14:0) concentrations were decreased in patients with BaPn. LPCs have also been proposed as useful biomarkers for diagnosing sepsis and predicting patient mortality28, while metabolomics and lipidomics screening have recently revealed decreased serum LPC levels in patients with COVID-19 with lung injury29,30. Our results further strengthen the association between blood LPC levels and acute inflammation. Moreover, LPCs have been reported as candidate biomarkers for cancer and diabetes, with which chronic inflammation is associated25,31,32. However, we found that plasma LPC(14:0) concentrations were higher in patients with the chronic inflammatory lung diseases IIP, CTD, COPD, and BrAs, than that in patients with DILD. In addition, we demonstrated the negligible impact of cancers and diabetes on blood LPC(14:0) concentrations. Therefore, the association between chronic inflammation and blood LPC levels may be to a lesser extent than that of acute inflammation and blood LPC levels.

Notably, the estimated cut-off values of LPC(14:0) for discriminating between acute DILD and IIP were compatible with recovery phase DILD, or DILD tolerance, which was approximately 275 ng/mL. Consequently, LPC(14:0) could be useful for specifically diagnosing DILD compared to IIP, which have different treatment strategies. For LuCa and CTD, the estimated cut-off values of LPC(14:0) were around 380 ng/mL, which is over 100 ng/mL above the cut-off for DILD diagnosis. Whilst LPC(14:0) was not applicable for discriminating DILD from BaPn, culture-, antigen-, or gene-based tests for bacteria can be used for diagnosing BaPn.

To date, no direct mechanism-based evidence of decreased plasma LPC concentrations has been provided in patients with DILD. However, the roles of LPCs and their metabolizing enzyme autotaxin in inflammation and immune response have been well characterized33,34. Autotaxin acts as phospholipase D to cleave choline from LPC and produce lysophosphatidic acid (LPA), which activates various LPA receptors to induce inflammatory and anti-inflammatory responses33,34. Serum autotoxin levels have been reported to correlate with the severity and mortality of acute respiratory distress syndrome35, and increased serum autotoxin levels have been detected in patients with severe COVID-1936. Plasma autotaxin levels also correlate with mortality in patients with severe sepsis37, suggesting that autotaxin may play a role in systemic hyper-inflammation and decreased plasma LPC concentrations in DILD. However, autotaxin also exerts important effects in chronic inflammatory diseases, such as idiopathic pulmonary fibrosis (IPF)34, and is present at increased levels in the bronchoalveolar lavage fluid of IPF models38. Since our study demonstrated a lesser impact of chronic inflammation on plasma LPC levels than that of acute inflammation, the role of autotaxin on the plasma LPC levels during acute and chronic inflammation may be different and/or could involve other factors.

Along with the mechanism regulating the decreased in plasma LPC concentrations in patients with DILD, there is no direct evidence for the role of decreased LPCs themselves on DILD pathogenesis. However, LPC has been reported to activate multiple signaling pathways involved in oxidative stress and inflammatory responses39. The roles of LPCs in endothelial cells, including vascular endothelium, and immune cells were characterized in association with worsened inflammation. For example, LPCs induce the production of cytokines in monocyte recruitment, cytotoxicity, apoptosis, and oxidative stress in endothelial cells40,41,42,43 as well as those involved in macrophage and B cell activation, and apoptosis in immune cells44,45,46. Therefore, one possible role of reduced LPCs in DILD pathogenesis is the counteraction of lung inflammation.

Despite our novel findings, this study has several limitations. First, the sample sizes for patients with DILD and related lung diseases were relatively small for verified clinical assessment. Second, although we demonstrated that causal drugs had a limited impact on plasma LPC concentrations, we were unable to fully exclude the possibility that the primary effects of causal drugs and their associated diseases may have influenced our outcomes. Third, we recruited patients with DILD from four core hospitals using the same sampling protocol; however, hospital-to-hospital variation in sample preparation may have yielded slightly different plasma LPC concentrations. Fourth, we did not control the alcohol and food habits of the patients included in this study, which might have affected postprandial metabolite responses since alcohol intake and a high-fat diet have been shown to alter the expression of hepatic genes, including those related to energy homeostasis and diet metabolism47,48. Fifth, this study only included Japanese patients, and thus our findings may not be applicable to other ethnic groups. Finally, we did not examine the effects of different radiological patterns, such as the distributed area of the affected lung lesions, which may be associated with different inflammatory pathways as well as lipid alteration. Therefore, these limitations should be addressed in further studies before applying LPCs as practical biomarkers for diagnosing DILD in a clinical setting.

In conclusion, this study identified LPC(14:0) as a general biomarker of DILD, which was superior to KL-6 and SP-D when discriminating between acute and recovery phase DILD and tolerant controls. In addition, we characterized an association between LPC(14:0) and disease severity and demonstrated its ability to discriminate between DILD and IIP or CTD. Together, the findings of this study suggest that LPC(14:0) could help diagnose DILD before and during the administration of drugs indicated to treat the onset of DILD or in patients with suspected DILD having shortness of breath or dyspnea.