Machine learning using clinical data at baseline predicts the efficacy of vedolizumab at week 22 in patients with ulcerative colitis

Miyoshi, Jun; Maeda, Tsubasa; Matsuoka, Katsuyoshi; Saito, Daisuke; Miyoshi, Sawako; Matsuura, Minoru; Okamoto, Susumu; Tamura, Satoshi; Hisamatsu, Tadakazu

doi:10.1038/s41598-021-96019-x

Download PDF

Article
Open access
Published: 12 August 2021

Machine learning using clinical data at baseline predicts the efficacy of vedolizumab at week 22 in patients with ulcerative colitis

Jun Miyoshi¹^na1,
Tsubasa Maeda²^na1,
Katsuyoshi Matsuoka³,
Daisuke Saito¹,
Sawako Miyoshi⁴,
Minoru Matsuura¹,
Susumu Okamoto⁴,
Satoshi Tamura² &
…
Tadakazu Hisamatsu¹

Scientific Reports volume 11, Article number: 16440 (2021) Cite this article

3963 Accesses
19 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Predicting the response of patients with ulcerative colitis (UC) to a biologic such as vedolizumab (VDZ) before administration is an unmet need for optimizing individual patient treatment. We hypothesized that the machine-learning approach with daily clinical information can be a new, promising strategy for developing a drug-efficacy prediction tool. Random forest with grid search and cross-validation was employed in Cohort 1 to determine the contribution of clinical features at baseline (week 0) to steroid-free clinical remission (SFCR) with VDZ at week 22. Among 49 clinical features including sex, age, height, body weight, BMI, disease duration/phenotype, treatment history, clinical activity, endoscopic activity, and blood test items, the top eight features (partial Mayo score, MCH, BMI, BUN, concomitant use of AZA, lymphocyte fraction, height, and CRP) were selected for logistic regression to develop a prediction model for SFCR at week 22. In the validation using the external Cohort 2, the positive and negative predictive values of the prediction model were 54.5% and 92.3%, respectively. The prediction tool appeared useful for identifying patients with UC who would not achieve SFCR at week 22 during VDZ therapy. This study provides a proof-of-concept that machine learning using real-world data could permit personalized treatment for UC.

Machine learning using clinical data at baseline predicts the medium-term efficacy of ustekinumab in patients with ulcerative colitis

Article Open access 22 February 2024

Vedolizumab is superior to infliximab in biologic naïve patients with ulcerative colitis

Article Open access 01 February 2023

The use of therapeutic drug monitoring for early identification of vedolizumab response in Saudi Arabian patients with inflammatory bowel disease

Article Open access 31 January 2023

Introduction

Ulcerative colitis (UC) is one of the major phenotypes of inflammatory bowel disease (IBD), and it is characterized by chronic colonic inflammation with periods of remission and relapse. Although the pathophysiology of ulcerative UC remains unclear, more patients has been able to achieve remission with the improvement of therapeutic options and strategies, which has led to better long-term prognosis^1,2,3,4. At present, various molecular targeted drugs, such as calcineurin inhibitor [cyclosporine A and tacrolimus (TAC)], anti-tumor necrosis factor alpha (TNFα) antibodies [adalimumab (ADA), golimumab, and infliximab (IFX)], anti-α₄β₇ integrin antibody [vedolizumab (VDZ)], anti-IL12/23p40 antibody [ustekinumab (UST)], and Janus kinase (JAK) inhibitor [tofacitinib (TOF)], are particularly used for treating patients with steroid-dependent/refractory UC. Meanwhile, in most clinical settings, it is challenging for physicians to identify the most effective molecular targeted drug for individual patients. When a physician considers starting a molecular targeted medication, the patient must have active disease that requires additional therapeutic intervention, that is, appropriate selection of a medication without delay is expected. In general, there is no guide for selecting the most suitable molecular targeted drug for the individual patient at present. This lack of a guide affects both patient outcomes and medical costs. Molecular targeted drugs are far more expensive than conventional medications such as 5-aminosalicylic acid (5-ASA), immunomodulators [e.g., azathioprine (AZA)], and steroids. The use of ineffective molecular targeted medications can represent a socioeconomic burden. Thus, predicting the efficacy of a molecular targeted medication before administration is crucial in this molecular-targeted therapy era.

The real-world pooled outcome of VDZ demonstrated that the rates of clinical response and remission at week 14 were 51% and 31%, respectively⁵. Given these rates, other medications might be more effective than VDZ for some patients with UC, and the prediction of VDZ efficacy in advance could provide these patients with an opportunity to initially receive another therapy. Several studies investigated the predictors of response to VDZ in UC^6,7. Among clinical factors at baseline, serum C-reactive protein (CRP) levels^8,9, serum albumin concentrations⁷, the Mayo Clinic score⁹, previous exposure to anti-TNF agents^7,10, disease duration⁷, and endoscopic activity⁷ have been reported to be associated with the clinical efficacy of VDZ in patients with UC. These previous studies employed statistical methods such as univariate and multivariate analyses to search for the predictors. In this study, we hypothesized that a new approach using machine learning could illuminate predictive factors of VDZ efficacy for UC that have not been detected as statistically significant using the conventional statistical approaches. In the present study, we investigated clinical features at baseline (week 0) that affect steroid-free clinical remission (SFCR) during VDZ therapy at week 22 and developed a prediction tool. Random forest (RF)¹¹ is an ensemble learning algorithm generating decision trees based on the training data. RF can also estimate the relative importance score for each feature. That is, RF allows the analysis of many factors simultaneously and provides insights into the contribution of each factor to the eventual outcome (i.e., achievement vs. no achievement of SFCR at week 22). We employed this method for clinical data at week 0 for patients with UC who started VDZ treatment for the induction of remission (training cohort), and the extracted factors were used to develop a prediction tool. The predictive accuracy of the tool was evaluated with another data set of patients who received VDZ for UC (test cohort).

The merit of this study is attempting to establish a prediction model based on generally available clinical information that was collected in daily practice. This is crucial for applying a machine learning-based prediction tool to the clinical setting. This pioneering work provides a proof-of-concept that the machine-leaning approach can be a new strategy for investigating predictors of the treatment efficacy in patients with UC and developing a prediction tool.

Methods

Study subjects

We retrospectively collected clinical data at baseline (week 0) and examined the clinical activity of UC at week 22 in 34 patients who (1) started VDZ at Kyorin University Hospital between September 2019 and April 2020 for the induction of remission, (2) underwent blood testing at week 0, and (3) underwent examination at Kyorin University Hospital at week 22 (training cohort, Cohort 1). As an extra-facility cohort, 35 patients with UC at Toho University Sakura Medical Center who (1) started VDZ between January 2019 and June 2020 for the induction of remission, (2) underwent blood testing at week 0, and (3) underwent examination at Toho University Sakura Medical Center at week 22 were analyzed (Cohort 2). The diagnosis of UC was confirmed using the clinical practice guidelines for IBD of The Japanese Society of Gastroenterology¹². VDZ treatment for the induction of remission was defined as VDZ started for active UC (Lichtiger index¹³ was ≥ 5).

Assessment of clinical efficacy

Clinical response at week 22 was assessed using the Lichtiger index¹³. Clinical remission was defined as a Lichtiger index of 4 or lower. Subjects who terminated VDZ treatment (switching to other medications) or needed surgery because of insufficient control of UC disease activity before week 22 were regarded as not achieving clinical remission at week 22.

Machine learning and prediction tools

To investigate clinical features related to SFCR during VDZ treatment at week 22, the data of 49 clinical features at week 0 were obtained from the Kyorin medical record system for patients in Cohort 1. The examined features included sex, age, height, body weight, body mass index (BMI), disease duration, disease type (inflammation distribution), treatment history for UC, clinical activity, endoscopic activity, and 25 blood test items (Table 1). The blood test was performed on the day of the first VDZ dose. Colonoscopy performed within 3 months before starting VDZ therapy was employed to obtain the baseline endoscopic findings. Categorical data were replaced with dummy variables. Missing values were imputed with the average value and the mode value for numerical data and categorical data, respectively. The data of patients in Cohort 2 were similarly collected from the Toho Sakura Medical Center medical record system. The standardized values of Cohort 1 were used for RF. RF was employed to develop a high-accuracy prediction model and identify which feature contributed to the prediction in the present study. RF is an ensemble technique using decision trees. In training, the RF algorithm creates multiple trees, and each tree is trained on the bootstrapped samples of the training data. Since the number of patients was limited in this study, RF was initialized using random values, and the training of RF was repeated 50 times. The contribution of each feature (49 clinical features, Table 1) to SFCR at week 22 was obtained by calculating the average value. When training the RF, the hyperparameters (number of trees and maximum depth of the tree) were automatically optimized via grid search and cross-validation. Grid search is a method for obtaining optimal hyperparameters in an algorithm. This performs a complete search over a given subset of the hyperparameter space of the training algorithm. The best hyperparameters are estimated according to the evaluation score of the validation data. Cross-validation is a resampling procedure for evaluating machine-learning models on a limited data sample. The general procedure is as follows: (1) split the dataset into k groups; (2) for each group, (i) select a group as a validation dataset, (ii) use the remaining groups (“k − 1” groups) as a training dataset, and (iii) fit a model on the training set and evaluate it on the validation set; and (3) calculate an average of k evaluation score. The final prediction result is obtained from the mode of predictions obtained from individual decision trees. The feature importance is determined according to the extent a decision tree node using each feature can reduce impurity across all trees in the forest. Next, logistic regression was used to develop a prediction tool in this study. Logistic regression is a classification algorithm for assigning each observation to a discrete set of classes. We inputted eight clinical features at week 0 that were selected as features with high contributions based on RF findings to predict the achievement/no achievement of SFCR at week 22. Logistic regression finally outputted the probability of which an observation vector belongs to a particular class using the logistic sigmoid function. The prediction accuracy of the model was assessed using the data of Cohort 2. We performed the machine learning in python and used the scikit-learn package.

Table 1 Baseline clinical features employed for machine learning.

Full size table

Ethical considerations

This study was conducted in accordance with the guidelines of the Declaration of Helsinki. This study was approved by the Institutional Ethics Committees of Kyorin University School of Medicine (Approval Number 1364) and Toho University Sakura Medical Center (Approval Number S20071). Informed consent was obtained from subjects (also from a parent when a patient was younger than 18 years) prior to the study.

Results

Training dataset

The clinical demographics of the 34 patients in Cohort 1 at baseline are presented in Table 2. The cohort consisted of 25 men and 9 women with a median age of 37 years (range, 17–92 years). The median disease duration was 5.0 years (range, 0.1–31.0 years). The disease diagnosis was total colitis in 28 patients and left-sided colitis in six patients. Thirty-three patients (97.1%) had been treated with 5-ASA before starting VDZ, seven of whom stopped 5-ASA treatment because of intolerance. Thirteen patients (38.2%) had previously received AZA before VDZ, three of whom stopped AZA therapy because of adverse events. In total, 29 (85.3%), 12 (35.3%), 1 (2.9%), 3 (8.8%), and 3 (8.8%) patients had been treated with prednisolone (PSL), anti-TNFα agents, TOF, TAC, and granulocyte and monocyte apheresis (GMA), respectively, before starting VDZ. No patient stopped these treatments because of adverse events. When starting VDZ (week 0), 21 (61.8%), 8 (23.5%), and 8 (23.5%) patients were using 5-ASA, AZA, and PSL, respectively. The clinical disease activity at baseline was assessed using the Lichtiger index and partial Mayo (pMayo) score for all patients (Table 2). Colonoscopy was performed at baseline in 31 patients. Endoscopic disease activity was assessed using the Mayo endoscopic subscore (MES) and ulcerative colitis endoscopic index of severity (UCEIS) (Table 2). The results of 25 blood test items at week 0 are presented in Table 2. At week 22, among the 34 patients, 18 patients (52.9%; 12 males and 6 females) achieved SFCR with VDZ. No patient stopped VDZ because of adverse events.

Table 2 Clinical demographics of Cohort 1 (34 patients).

Full size table

Test dataset

The clinical demographics of the 35 patients in Cohort 2 at baseline are presented in Table 3. This cohort included 22 men and 13 women with a median age of 42 years (range, 17–90 years). The median disease duration was 7.2 years (range, 0.6–38.0 years). The disease diagnosis was total colitis in 26 patients and left-sided colitis in nine patients. treatment history of 5-ASA, AZA, PSL, anti-TNFα agent, TOF, TAC, and GMA treatment before VDZ therapy was documented in 35 (100%), 16 (45.7%), 34 (97.1%), 20 (57.1%), 6 (17.1%), 9 (25.7%), and 13 patients (37.1%), respectively. No patient stopped treatment because of adverse events. When starting VDZ (week 0), 29 (82.9%), 4 (11.4%), and 11 (31.4%) patients were using 5-ASA, AZA, and PSL, respectively. The clinical disease activity at baseline was assessed using the Lichtiger index and pMayo score for all patients (Table 3). Colonoscopy was performed at baseline in 14 patients, and their endoscopic disease activity was assessed using MES and UCEIS (Table 3). SFCR at week 22 was achieved in 13 patients (37.1%; seven men and six women). No patient stopped VDZ because of adverse events.

Table 3 Clinical demographics of Cohort 2 (35 patients).

Full size table

Development of prediction tool for vedolizumab efficacy

RF using the data of 49 clinical features at baseline for patients in Cohort 1 was performed, and the contribution of each factor to SFCR at week 22 was determined (Fig. 1). The 10 clinical features with the highest contribution were the pMayo score, mean corpuscular hemoglobin (MCH) concentration (pg), BMI, blood urea nitrogen (BUN) concentration (mg/dL), concomitant use of AZA (+ / −), lymphocyte (Lympho) fraction (%), height (cm), C-reactive protein (CRP) concentration (mg/dL), total cholesterol (TCho) concentration (mg/dL), and neutrophil fraction (%). These features were employed for logistic regression to develop a prediction model. The predictive accuracy of the logistic regression models (achievement of SFCR at week 22: y = 1, no achievement of SFCR at week 22: y = 0, threshold: y = 0.5) in Cohorts 1 and 2 is presented in Table 4. When the top 8 features (pMayo score, MCH, BMI, BUN, concomitant use of AZA, Lympho fraction, height, and CRP) were employed, the predictive accuracy was 100% in Cohort 1, versus 68.6% in Cohort 2. The equation of logistic regression using the features was as follows:

$$ {\text{y}} = 1/(1 + {\text{e}}{\,\hat\,}( - {\text{x}})). $$

$$ {\text{x}} = {\text{a}}_{0} \times \left[ {{\text{standardized}} - {\text{pMayo score}}} \right] + {\text{a}}_{{1}} \times \left[ {{\text{standardized}} - {\text{MCH}}} \right] + {\text{a}}_{{2}} \times \left[ {{\text{standardized}} - {\text{BMI}}} \right] + {\text{a}}_{{3}} \times \left[ {{\text{standardized}} - {\text{BUN}}} \right] + {\text{a}}_{{4}} \times \left[ {\text{concomitant use of AZA}} \right] + {\text{a}}_{{5}} \times \left[ {{\text{standardized}} - {\text{Lympho fraction}}} \right] + {\text{a}}_{{6}} \times \left[ {{\text{standardized}} - {\text{height}}} \right] + {\text{a}}_{{7}} \times \left[ {{\text{standardized}} - {\text{CRP}}} \right] \, - \, 0.{27955142}. $$

$$ {\text{a}}_{0} = - {2}.0{9616139}0{278958},{\text{ a}}_{{1}} = {1}.0{592561253594117},{\text{ a}}_{{2}} = - 0.{34465735}0{8632}0{3}0{4},{\text{ a}}_{{3}} = {2}.{7}0{5485}0{91}0{49323},{\text{ a}}_{{4}} = {6}.{718131278}0{58346},{\text{ a}}_{{5}} = {1}.{5638386797677}0{9},{\text{ a}}_{{6}} = - {2}.{523}0{13372748}0{39},{\text{ a}}_{{7}} = {1}.{96491}0{7396733663}. $$

$$ \begin{gathered} {\text{Standardized}} - {\text{pMayo score}} = \left( {{\text{pMayo score }} - { 6}.{235294118}} \right)/{1}.{284725275}. \hfill \\ {\text{Standardized}} - {\text{MCH}} = \left( {{\text{MCH }} - { 29}.{84411765}} \right)/{2}.{896365997}. \hfill \\ {\text{Standardized}} - {\text{BMI}} = \left( {{\text{BMI }} - { 2}0.0{7734956}} \right)/{3}.{472}0{63638}. \hfill \\ {\text{Standardized}} - {\text{BUN}} = \left( {{\text{BUN }} - { 9}.{8258}0{6452}} \right)/{4}.{9}0{95672}0{7}. \hfill \\ \end{gathered} $$

$$ \begin{gathered} {\text{Concomitant use of AZA}} = 0 \, \left( {{\text{No}}} \right){\text{ or 1 }}\left( {{\text{YES}}} \right). \hfill \\ {\text{Standardized}} - {\text{Lympho fraction}} = \left( {{\text{Lympho fraction }} - { 19}.{515625}} \right)/{7}.{9973}0{85}0{7}{\text{.}} \hfill \\ {\text{Standardized}} - {\text{height}} = \left( {{\text{height }} - { 164}.{39}0{9}0{91}} \right)/{9}.0{46524986}{\text{.}} \hfill \\ {\text{Standardized}} - {\text{CRP}} = \left( {{\text{CRP }} - { 1}.{779375}} \right)/{2}.{448675583}{\text{.}} \hfill \\ \end{gathered} $$

Table 4 Predictive accuracy of logistic regression models for steroid-free clinical remission at week 22 comprising the top 10 contributing clinical features.

Full size table

The calculated value of y and the accuracy of the prediction in each patient in Cohorts 1 and 2 are presented in Supplemental Table 1. In Cohort 2, the positive predictive value (achievement of SFCR) and negative predictive value (NPV; no achievement of SFCR) were 54.5% and 92.3%, respectively (Table 5).

Table 5 Predictive ability of the model steroid-free clinical remission at week 22 in Cohort 2.

Full size table

Discussion

In the present study, we analyzed 49 clinical features at week 0 using real-world data with the RF algorithm and determined the contribution of each clinical feature to the achievement of SFCR after 22 weeks of VDZ therapy. It is an advantage of RF that we could investigate the contribution of these various clinical features in our cohorts despite the limited the number of subjects. Generally, it is challenging to assess a large number of features in detail using statistical methodology, such as univariate and multivariate analyses, which require a huge number of subjects. In addition, we believe that we need to interpret the “p-value” in statistical analyses carefully, although we acknowledge statistical significance provides scientific insights. Some factors without statistical significance may potentially contribute to the outcome. Assessing the contribution of factors comprehensively with RF could be a promising approach for identifying predictors, particularly in a complex situation in which various factors can be involved as such SFCR after VDZ treatment. Logistic regression was employed in this study to develop a prediction tool with clinical features using the eight largest contributors; pMayo score, MCH (pg), BMI, BUN (mg/dL), concomitant use of AZA, Lympho fraction (%), height (cm), and CRP (mg/dL). Our model revealed a high NPV (92.3%) for SFCR at week 22. This finding suggests that it would be better to consider other options if our model predicts VDZ will be ineffective for an individual patient. In the logistic regression model, the coefficient of each factor indicates if a factor is positively or negatively associated with the outcome. Our logistic regression model illustrated that a lower pMayo score, higher MCH concentration, lower BMI, higher BUN concentration, concomitant use of AZA, higher Lympho fraction percentage, shorter height, and higher CRP concentration at week 0 were favorable for SFCR at week 22. We believe that interpreting the machine-learning results from medical and physiological viewpoints is crucial for considering the clinical significance of the model, and it could provide an opportunity to improve clinical practice.

A lower pMayo score indicates less clinical disease activity¹⁴. Higher MCH levels suggest that bleeding attributable to UC and iron, vitamin B₁₂, or folic acid deficiency are less severe. Since no patient had overt renal dysfunction in our cohorts, higher BUN levels are believed to reflect the intake of sources of nitrogen, i.e., patients’ dietary intake, particularly amino acids. Taken together, these factors imply that less disease activity and a better general and nutritional status are favorable for SFCR during VDZ therapy. In the present study, TCho (mg/dL) was one of the nine strongest contributors in RF, and when we included this feature in the logistic regression model, its coefficient was positive. Because TCho levels are decreased in response to malnutrition induced by active inflammation, this finding also suggests a better nutritional condition is positively related to VDZ efficacy. Barré et al. reviewed several reports on the predictors of VDZ treatment for UC and noted that severe disease activity at induction is a negative predictor⁶. Dulai et al. developed a tool to predict the response to VDZ including baseline moderate activity on endoscopy and albumin levels as positive predictors⁷. Our findings and interpretations of the pMayo score, MCH level, and BUN level appear compatible with these previous studies. Interestingly, lower BMI and shorter height were included as positive predictors in our prediction model. We speculate that these factors suggest a high VDZ concentration in the body because the dose was fixed as 300 mg/injection. In the GEMINI I study, a positive correlation was observed between VDZ serum concentrations and clinical response¹⁵. Samaan et al. reported that VDZ dose intensification was effective in patients with IBD with a suboptimal treatment response¹⁶. In a review by Barré et al., a low trough level of VDZ is cited as a negative predictor⁶. Together with these reports and our findings, we speculate that adjusting the dose of VDZ depending on BMI could increase its efficacy. Meanwhile, caution may be needed when applying our prediction tool to patients with overt emaciation that far exceeds the range observed in the training dataset. It is noteworthy that the concomitant use of AZA was detected as a positive predictor, and the absolute value of its coefficient was the largest in our model; i.e., concomitant AZA use has a larger impact on SFCR at week 22 than the other features. Whereas the benefit of the combination of an immunomodulator and VDZ over VDZ monotherapy has not been established, our machine-learning approach identified the potentially beneficial effect of concomitant AZA use. We believe that the results for BMI/height and concomitant AZA use raise an important clinical question concerning the optimization of VDZ treatment for UC. Meanwhile, our finding that a higher Lympho fraction was related to SFCR during VDZ treatment suggests that VDZ responders could comprise a subgroup of UC with a specific pathophysiology. VDZ is a humanized monoclonal antibody directed toward α₄β₇ integrin. α₄β₇ integrin is expressed on the surface of lymphocytes, and it interacts with mucosal addressin cell adhesion molecule-1 (MAdCAM-1), which leads to the migration of lymphocytes to the intestine¹⁷. Based on this specific mechanism and our finding, we speculate that there could be a “lymphocyte-dominant” subgroup of UC, and VDZ exerts particularly efficacy in such patients. The machine-learning approach would be useful for developing a prediction tool and obtaining clues for characterizing UC pathophysiology and subgrouping patients. Our model indicated that higher CRP levels were related to SFCR at week 22. This finding is incompatible with a previous report⁶, and it appears inconsistent with the favorability of a lower pMayo score. Among subjects with and without SFCR at week 22 in the training dataset, the mean and standard error of the mean (SEM) of CRP levels were 1.566 ± 0.6187 mg/dL and 2.054 ± 0.6328 mg/dL, respectively (p = 0.0532, Mann–Whitney U test). However, four subjects who achieved SFCR had a high CRP level (8.37 mg/dL, 6.43 mg/dL, 6.34 mg/dL, and 2.25 mg/dL, respectively), whereas the level was 0.02–1.88 mg/dL in the other patients who achieved SFCR (the normal CRP level is ≤ 0.14 mg/dl). Given that CRP levels were not high overall in Cohort 1, the results of these four patients might affect the decision of the machine-learning algorithm.

We consider three future directions of the machine-learning approach for UC clinical data: (1) aiming for higher prediction accuracy, (2) developing prediction tools for various medications, and (3) searching for factors potentially involved in UC pathophysiology. Regarding (1), this study was limited by its small size. Larger training and test cohorts are needed to improve the prediction model and its accuracy. Additionally, it will be interesting to test other machine-learning methodologies, such as k-NN and support vector machine, and determine if those approaches can generate a better model. Point (2) is crucial for personalized medicine for UC. That is, if we have a prediction tool for each therapeutic intervention, we can run the multiple tools at baseline and determine which intervention is most suitable for individual patients. For instance, whereas Dulai et al. developed a prediction model for VDZ efficacy in patients with Crohn’s disease (CD)¹⁸, Alric et al. observed that the model could not predict the efficacy of UST in patients with CD¹⁹. Several cutting-edge studies are exploring the predictors of VDZ efficacy in patients with IBD. Ananthakrishnan et al. reported that the functional profile of the gut microbiome can be a predictor of VDZ efficacy at week 14 in patients with IBD²⁰. Rath et al. analyzed peripheral blood and colonic biopsy samples for CD4⁺ T cell subpopulations, cytokine production, and mRNA and protein expression including the α₄β₇ integrin and MAdCAM-1 to investigate factors associated with VDZ efficacy in patients with IBD and revealed a significant difference in genetic signatures at baseline between subjects with and without clinical remission at week 14²¹. Verstockt et al. employed machine-learning methods and reported that the expression of four genes in colon tissue could be predictive of VDZ efficacy in patients with IBD²². Gazouli et al. analyzed the mucosal expression of immunological and inflammatory genes using a machine-learning algorithm and demonstrated that the response to VDZ in patients with UC is associated with mucosal gene expression profiles at baseline²³. Although these findings are interesting, at present, they cannot be feasibly examined in a clinical setting. We believe it is advantageous to analyze common clinical features that can be obtained in a clinical setting to allow application of the predictors and prediction models in daily practice. Regarding point (3), adding experimental factors to the metadata for machine learning may provide opportunities to investigate novel factors associated with outcomes and understand the underlying pathological features of UC. Previous studies demonstrated that mucosal gene expression profiles are related to the treatment response of patients with UC^24,25. Kim et al. reported that mucosal eosinophilia is a predictor of VDZ efficacy in patients with IBD²⁶. These findings suggest the possibility that more factors that contribute to the clinical outcome have not been examined in daily practice. Analyzing various hypothetical predictors (e.g., cytokine levels, gene expression, histological characteristics) together with machine-learning approaches would provide insights into the contribution of each factor and facilitate the discovery of the characteristics of UC subgroups. In conclusion, with machine learning, we determined the contribution of clinical features at week 0 to the achievement of SFCR in patients who received VDZ for UC at week 22 and developed a prediction model. The predictive accuracy was confirmed in a separate cohort. The concept and findings in this study will promote personalized medicine in UC, and they could possibly be extrapolated to other medications and diseases.

Data availability

The data underlying this article will be shared by the corresponding author upon reasonable request.

Abbreviations

5-ASA:: 5-Aminosalicylic acid
AZA:: Azathioprine
BMI:: Body mass index
BUN:: Blood urea nitrogen
CD:: Crohn’s disease
CRP:: C-reactive protein
GMA:: Granulocyte and monocyte apheresis
JAK:: Janus kinase
Lympho:: Lymphocyte
MadCAM-1:: Mucosal addressin cell adhesion molecule-1
MCH:: Mean corpuscular hemoglobin
MES:: Mayo endoscopic subscore
NPV:: Negative predictive value
pMayo score:: Partial Mayo score
PPV:: Positive predictive value
PSL:: Prednisolone
RF:: Random forest
SFCR:: Steroid-free clinical remission
TAC:: Tacrolimus
TCho:: Total cholesterol
TNF:: Tumor necrosis factor
TOF:: Tofacitinib
UC:: Ulcerative colitis
UCEIS:: Ulcerative colitis endoscopic index of severity
UST:: Ustekinumab
VDZ:: Vedolizumab

References

Winther, K. V., Jess, T., Langholz, E., Munkholm, P. & Binder, V. Survival and cause-specific mortality in ulcerative colitis: Follow-up of a population-based cohort in Copenhagen County. Gastroenterology 125, 1576–1582. https://doi.org/10.1053/j.gastro.2003.09.036 (2003).
Article PubMed Google Scholar
Fumery, M. et al. Natural history of adult ulcerative colitis in population-based cohorts: A systematic review. Clin. Gastroenterol. Hepatol. 16, 343-356e343. https://doi.org/10.1016/j.cgh.2017.06.016 (2018).
Article PubMed Google Scholar
Armuzzi, A. et al. Treatment patterns among patients with moderate-to-severe ulcerative colitis in the United States and Europe. PLoS ONE 15, e0227914. https://doi.org/10.1371/journal.pone.0227914 (2020).
Article PubMed PubMed Central CAS Google Scholar
Khoudari, G. et al. Rates of intestinal resection and colectomy in inflammatory bowel disease patients after initiation of biologics: A cohort study. Clin. Gastroenterol. Hepatol. https://doi.org/10.1016/j.cgh.2020.10.008 (2020).
Article PubMed Google Scholar
Engel, T. et al. Vedolizumab in IBD-lessons from real-world experience; A systematic review and pooled analysis. J. Crohns Colitis 12, 245–257. https://doi.org/10.1093/ecco-jcc/jjx143 (2018).
Article PubMed Google Scholar
Barre, A., Colombel, J. F. & Ungaro, R. Review article: Predictors of response to vedolizumab and ustekinumab in inflammatory bowel disease. Aliment Pharmacol. Ther. 47, 896–905. https://doi.org/10.1111/apt.14550 (2018).
Article PubMed CAS Google Scholar
Dulai, P. S. et al. Development and validation of clinical scoring tool to predict outcomes of treatment with vedolizumab in patients with ulcerative colitis. Clin. Gastroenterol. Hepatol. 18, 2952-2961e2958. https://doi.org/10.1016/j.cgh.2020.02.010 (2020).
Article PubMed PubMed Central CAS Google Scholar
Shelton, E. et al. Efficacy of vedolizumab as induction therapy in refractory IBD patients: A multicenter cohort. Inflamm. Bowel Dis. 21, 2879–2885. https://doi.org/10.1097/MIB.0000000000000561 (2015).
Article PubMed Google Scholar
Amiot, A. et al. Effectiveness and safety of vedolizumab induction therapy for patients with inflammatory bowel disease. Clin. Gastroenterol. Hepatol. 14, 1593–16011592. https://doi.org/10.1016/j.cgh.2016.02.016 (2016).
Article PubMed CAS Google Scholar
Stallmach, A. et al. Vedolizumab provides clinical benefit over 1 year in patients with active inflammatory bowel disease - A prospective multicenter observational study. Aliment Pharmacol Ther. 44, 1199–1212. https://doi.org/10.1111/apt.13813 (2016).
Article PubMed CAS Google Scholar
Breiman, L. Random frests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article MATH Google Scholar
Nakase, H. et al. Evidence-based clinical practice guidelines for inflammatory bowel disease 2020. J. Gastroenterol. 56, 489–526. https://doi.org/10.1007/s00535-021-01784-1 (2021).
Article PubMed PubMed Central Google Scholar
Lichtiger, S. et al. Cyclosporine in severe ulcerative colitis refractory to steroid therapy. N. Engl. J. Med. 330, 1841–1845. https://doi.org/10.1056/NEJM199406303302601 (1994).
Article PubMed CAS Google Scholar
Schroeder, K. W., Tremaine, W. J. & Ilstrup, D. M. Coated oral 5-aminosalicylic acid therapy for mildly to moderately active ulcerative colitis. A randomized study. N. Engl. J. Med. 317, 1625–1629. https://doi.org/10.1056/NEJM198712243172603 (1987).
Article PubMed CAS Google Scholar
Feagan, B. G. et al. Vedolizumab as induction and maintenance therapy for ulcerative colitis. N. Engl. J. Med. 369, 699–710. https://doi.org/10.1056/NEJMoa1215734 (2013).
Article PubMed CAS Google Scholar
Samaan, M. A. et al. Effectiveness of vedolizumab dose intensification to achieve inflammatory bowel disease control in cases of suboptimal response. Frontline Gastroenterol. 11, 188–193. https://doi.org/10.1136/flgastro-2019-101259 (2020).
Article PubMed CAS Google Scholar
Briskin, M. et al. Human mucosal addressin cell adhesion molecule-1 is preferentially expressed in intestinal tract and associated lymphoid tissue. Am. J. Pathol. 151, 97–110 (1997).
PubMed PubMed Central CAS Google Scholar
Dulai, P. S. et al. A clinical decision support tool may help to optimise vedolizumab therapy in Crohn’s disease. Aliment Pharmacol. Ther. 51, 553–564. https://doi.org/10.1111/apt.15609 (2020).
Article PubMed CAS Google Scholar
Alric, H. et al. Vedolizumab clinical decision support tool predicts efficacy of vedolizumab but not ustekinumab in refractory Crohn’s disease. Inflamm. Bowel Dis. https://doi.org/10.1093/ibd/izab060 (2021).
Article PubMed Google Scholar
Ananthakrishnan, A. N. et al. Gut microbiome function predicts response to anti-integrin biologic therapy in inflammatory bowel diseases. Cell Host Microbe 21, 603-610e603. https://doi.org/10.1016/j.chom.2017.04.010 (2017).
Article PubMed PubMed Central CAS Google Scholar
Rath, T. et al. Effects of anti-integrin treatment with vedolizumab on immune pathways and cytokines in inflammatory bowel diseases. Front. Immunol. 9, 1700. https://doi.org/10.3389/fimmu.2018.01700 (2018).
Article PubMed PubMed Central CAS Google Scholar
Verstockt, B. et al. Expression levels of 4 genes in colon tissue might be used to predict which patients will enter endoscopic remission after vedolizumab therapy for inflammatory bowel diseases. Clin. Gastroenterol. Hepatol. 18, 1142–11511110. https://doi.org/10.1016/j.cgh.2019.08.030 (2020).
Article PubMed PubMed Central CAS Google Scholar
Gazouli, M. et al. Response to anti-alpha4beta7 blockade in patients with ulcerative colitis is associated with distinct mucosal gene expression profiles at baseline. Inflamm. Bowel Dis. https://doi.org/10.1093/ibd/izab117 (2021).
Article PubMed Google Scholar
Telesco, S. E. et al. Gene expression signature for prediction of golimumab response in a phase 2a open-label trial of patients with ulcerative colitis. Gastroenterology 155, 1008-1011e1008. https://doi.org/10.1053/j.gastro.2018.06.077 (2018).
Article PubMed CAS Google Scholar
Haberman, Y. et al. Ulcerative colitis mucosal transcriptomes reveal mitochondriopathy and personalized mechanisms underlying disease severity and treatment response. Nat. Commun. 10, 38. https://doi.org/10.1038/s41467-018-07841-3 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Kim, E. M. et al. Mucosal eosinophilia is an independent predictor of vedolizumab efficacy in inflammatory bowel diseases. Inflamm. Bowel Dis. 26, 1232–1238. https://doi.org/10.1093/ibd/izz251 (2020).
Article PubMed Google Scholar

Download references

Acknowledgements

This work was supported in part by grants from the Japan Sciences Research Grant for Research on Intractable Diseases (Japanese Inflammatory Bowel Disease Research Group) affiliated with the Japan Ministry of Health Labour and Welfare. We thank Dr. Mark W. Musch for his English proofreading. We thank Joe Barber Jr., PhD. from Edanz (https://www.edanz.com/ac) for editing a draft of this manuscript.

Author information

These authors contributed equally: Jun Miyoshi and Tsubasa Maeda.

Authors and Affiliations

Department of Gastroenterology and Hepatology, Kyorin University School of Medicine, 6-20-2 Shinkawa, Mitaka-shi, Tokyo, 181-8611, Japan
Jun Miyoshi, Daisuke Saito, Minoru Matsuura & Tadakazu Hisamatsu
Department of Electrical, Electronic and Computer Engineering, Faculty of Engineering, Gifu University, 1-1 Yanagido, Gifu-shi, Gifu, 501-1193, Japan
Tsubasa Maeda & Satoshi Tamura
Division of Gastroenterology and Hepatology, Department of Internal Medicine, Toho University Sakura Medical Center, 564-1 Simoshizu, Sakura-shi, Chiba, 285-0841, Japan
Katsuyoshi Matsuoka
Department of General Medicine, Kyorin University School of Medicine, 6-20-2 Shinkawa, Mitaka-shi, Tokyo, 181-8611, Japan
Sawako Miyoshi & Susumu Okamoto

Authors

Jun Miyoshi
View author publications
Search author on:PubMed Google Scholar
Tsubasa Maeda
View author publications
Search author on:PubMed Google Scholar
Katsuyoshi Matsuoka
View author publications
Search author on:PubMed Google Scholar
Daisuke Saito
View author publications
Search author on:PubMed Google Scholar
Sawako Miyoshi
View author publications
Search author on:PubMed Google Scholar
Minoru Matsuura
View author publications
Search author on:PubMed Google Scholar
Susumu Okamoto
View author publications
Search author on:PubMed Google Scholar
Satoshi Tamura
View author publications
Search author on:PubMed Google Scholar
Tadakazu Hisamatsu
View author publications
Search author on:PubMed Google Scholar

Contributions

J.M., T.M., K.M., M.M., S.T., and T.H. conceived the study, designed experiments, and prepared the manuscript. J.M, T.M., D.S., S.M., and S.T. perform experiments and analyze data. K.M., M.M., S.O., S.T., and T.H. supervised the manuscript. S.T. and T.H. oversaw the entire project.

Corresponding authors

Correspondence to Satoshi Tamura or Tadakazu Hisamatsu.

Ethics declarations

Competing interests

Jun Miyoshi has received lecture fees from JIMRO Co. and Takeda Pharmaceutical Co. Ltd. Katsuyoshi Matsuoka has served as a scientific adviser for EA Pharma; served on advisory boards for Bristol-Meyers Squibb, and Eli Lilly and Company; received speaker fees from Mitsubishi Tanabe Pharma, Takeda Pharmaceutical, Janssen, AbbVie, EA Pharma, Pfizer Inc, Mochida Pharmaceutical, Kyorin Pharmaceutical, Zeria Pharmaceutical, Kissei Pharmacetucal, and JIMRO; and received research grants from Mitsubishi Tanabe Pharma, Mochida Pharmaceutical, Kyorin Pharmaceutical, AbbVie, Takeda Pharmaceutical, Nippon Kayaku, EA Pharma, Kissei Pharmaceutical, and JIMRO Co. Minoru Matsuura has received consulting and lecture fees from Janssen Pharmaceutical K.K., Takeda Pharmaceutical Co. Ltd., AbbVie GK, Mitsubishi Tanabe Pharma Corporation, Kyorin Pharmaceutical Co. Ltd., Mochida Pharmaceutical Co., Ltd., JIMRO Co., Nippon Kayaku Co. Ltd., Mylan EPD G.K., and Aspen Japan Co. Ltd. Tadakazu Hisamatsu has performed Joint Research with Alfresa Pharma Co. Ltd., EA Pharma Co. Ltd.; received grant support from Mitsubishi Tanabe Pharma Corporation, EA Pharma Co. Ltd., AbbVie GK, JIMRO Co. Ltd., Zeria Pharmaceutical Co. Ltd., Daiichi-Sankyo, Kyorin Pharmaceutical Co. Ltd., Nippon Kayaku Co. Ltd., Takeda Pharmaceutical Co. Ltd., Pfizer Inc., Mochida Pharmaceutical Co., Ltd.; and received consulting and lecture fees from EA Pharma Co. Ltd., AbbVie GK, Celgene K.K., Janssen Pharmaceutical K.K., Pfizer Inc., Nichi-Iko Pharmaceutical Co., Ltd., Mitsubishi Tanabe Pharma Corporation , Kyorin Pharmaceutical Co. Ltd., JIMRO Co., Mochida Pharmaceutical Co., Ltd., and Takeda Pharmaceutical Co. Ltd. Tsubasa Maeda, Daisuke Saito, Sawako Miyoshi, Susumu Okamoto, and Satoshi Tamura have no conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Miyoshi, J., Maeda, T., Matsuoka, K. et al. Machine learning using clinical data at baseline predicts the efficacy of vedolizumab at week 22 in patients with ulcerative colitis. Sci Rep 11, 16440 (2021). https://doi.org/10.1038/s41598-021-96019-x

Download citation

Received: 08 June 2021
Accepted: 04 August 2021
Published: 12 August 2021
DOI: https://doi.org/10.1038/s41598-021-96019-x

This article is cited by

Machine learning using clinical data at baseline predicts the medium-term efficacy of ustekinumab in patients with ulcerative colitis
- Hiromu Morikubo
- Ryuta Tojima
- Tadakazu Hisamatsu
Scientific Reports (2024)
Predictive, preventive and personalised approach as a conceptual and technological innovation in primary and secondary care of inflammatory bowel disease benefiting affected individuals and populations
- Laura Arosa
- Miguel Camba-Gómez
- Javier Conde-Aranda
EPMA Journal (2024)
A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
- Linxi Meng
- Will Treem
- Jingjing Chen
Scientific Reports (2022)

Subjects

Abstract

Similar content being viewed by others

Machine learning using clinical data at baseline predicts the medium-term efficacy of ustekinumab in patients with ulcerative colitis

Vedolizumab is superior to infliximab in biologic naïve patients with ulcerative colitis

The use of therapeutic drug monitoring for early identification of vedolizumab response in Saudi Arabian patients with inflammatory bowel disease

Introduction

Methods

Study subjects

Assessment of clinical efficacy

Machine learning and prediction tools

Ethical considerations

Results

Training dataset

Test dataset

Development of prediction tool for vedolizumab efficacy

Discussion

Data availability

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Table S1.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Machine learning using clinical data at baseline predicts the medium-term efficacy of ustekinumab in patients with ulcerative colitis

Predictive, preventive and personalised approach as a conceptual and technological innovation in primary and secondary care of inflammatory bowel disease benefiting affected individuals and populations

A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data

Search

Quick links