Introduction

Intravenous fluid resuscitation is a widely employed therapy during the management of patients experiencing septic shock. The primary objective of this therapy is to replenish or enhance intravascular volume, which is often depleted or compromised in sepsis due to vasodilation of the vascular network1,2. This approach aims to improve both macrovascular perfusion, such as stroke volume and cardiac output, and microvascular perfusion, including capillary blood flow, thereby addressing organ hypoperfusion—a key aspect of sepsis pathophysiology that guides resuscitation efforts3. However, intravenous fluid resuscitation may lead to dilutional coagulopathy, fluid overload, and pathological edema in various organs, particularly the lungs4,5. Additionally, vasopressor agents are frequently utilized to address hypoperfusion by inducing constriction of arterioles and venules while enhancing cardiac contractility. Nonetheless, vasopressor therapy carries inherent risks, such as vasoconstriction-associated tissue ischemia, elevated cardiac workload, and potential arrhythmias. Clinicians have historically combined these two therapeutic modalities to deliver supportive care to individuals with sepsis-related hypoperfusion. However, there remains a scarcity of data guiding the precise administration of intravenous fluids or vasopressors in the management of septic shock. Current resuscitation practices in patients with septic shock exhibit variability, ranging from restricted volume and early vasopressor administration to liberal fluid administration and delayed vasopressor initiation6.

Numerous clinical trials have been conducted to investigate the efficacy of liberal vs restrictive fluid strategies, employing diverse fluid administration protocols7,8,9,10,11,12,13. However, these studies have not consistently demonstrated the superiority of either strategy14. Septic shock encompasses a heterogeneous patient population, suggesting that distinct subgroups may exhibit disparate responses to a given fluid strategy1,15. Therefore, precise management of septic shock holds promise for uncovering insights into fluid strategies. A multi-omics study involves integrating and analyzing data from various omics disciplines, including genomics, transcriptomics, proteomics, metabolomics, and epigenomics. By amalgamating information from multiple omics layers, researchers can achieve a more comprehensive understanding of biological systems and processes of sepsis16,17,18. Fluid administration can impact various biological systems, such as coagulation, endothelial function, electrolyte balance, and inflammatory responses. Thus, employing systems biology approaches to identify subgroups of septic shock patients may offer new insights into the effectiveness of fluid strategies. In this work, we identify septic shock patient subgroups with differential responses to fluid strategies and develop a proteomic signature for predicting group membership. Ultimately, this research lays the foundation for precise fluid management in patients with septic shock.

Results

Study population

The study included a total of 494 patients diagnosed with septic shock, contributing to 1285 person-days, with 352 patients in the training cohort (Table 1). The validation cohort, consisting of 142 patients, was sourced from independent hospitals distinct from those contributing to the training set (Fig. 1). The treatment provided was consistent with the usual care at the respective hospitals, and no specific intervention was introduced by the study protocol. Both cohorts exhibited similar baseline characteristics, encompassing age, sex, height, weight, sequential organ failure assessment (SOFA) scores, mortality rates, and length of hospital stays. However, differences were noted in terms of infection site and serum lactate levels between the two cohorts. In the training cohort, 255 patients received a liberal fluid strategy, while 97 received a restrictive fluid strategy on day 1 (Supplementary Table 1). Patients assigned to the restrictive strategy group presented with more severe illness (SOFA score = 10 [8, 12] vs 8 [6.5, 11], p < 0.001), a higher mortality rate (24% vs 16%), and increased vasopressor dosages. Additionally, patients in the restrictive strategy group had significantly lower fluid intake (2110 [1556.7, 2569] ml vs 3448.1 [2504, 4820.8] ml/day, p < 0.001) and reduced urine output (815 [285, 1587.5] vs 1100 [505, 2373.75] ml/day; p = 0.007).

Table 1 Baseline characteristics of subjects in the training and validation cohorts
Fig. 1: An overview of the analysis workflow of the study.
figure 1

The workflow involved four steps construction of a multi-omics dataset, subgroup identification, model validation in an independent cohort, and development of a proteomic signature. LASSO least absolute shrinkage and selection operator, ROC receiver operating characteristics, AUC area under curve, RNA-seq ribonucleic acid sequencing.

Subgroup identification

In our study, we comprehensively quantified 61,792 genes across all samples before applying our filtering criteria, including both protein-coding and non-protein-coding genes. For a detailed description of our methodology, including library preparation, sequencing specifications, alignment and quantification techniques, and filtering criteria, please refer to the supplemental methods provided. Genes with a total expression count of less than 100 across all samples were filtered out. This criterion was chosen to ensure that we focus our analysis on genes with sufficient overall expression to provide reliable statistical power, while minimizing the inclusion of genes with very low expression, which might introduce noise and reduce the robustness of our findings. There was overlap of the propensity score between the liberal and restrictive groups (Supplementary Fig. 1). Utilizing LASSO with cross-validation criterion, 13 genes were identified to interact with treatment effects, namely DEFA8P, ENSG00000255641, ENSG00000262304, ENSG00000264063, ENSG00000284292, ENSG00000285238, ENSG00000289707, FAM220A, FARP1, HLA-J, HNRNPUL2-BSCL2, IFNG-AS1, KLRC4, LY75-CD302, and PAQR9 (Supplementary Figs. 2 and 3). Analysis of the training dataset revealed that patients with congruent recommended and administered fluid strategies experienced significantly improved survival outcomes compared to those with disparate strategies (Supplementary Table 2 and Fig. 2). In particular, the Liberal_Restrict group had the worst survival outcome (Fig. 2 and Table 2). The treatment effects, conditioned on recommended subgroups, yielded mean differences in survival days of 59.7 days (estimate of \(E\left[{Y|T}={liberal},{recom}={liberal}\right]-E\left[{Y|T}={restrictive},{recom}={liberal}\right]\)) and 41.7 days (estimate of \(E\left[{Y|T}={restrictive},{recom}={erstrictive}\right]-E\left[{Y|T}={liberal},{recom}={restrictive}\right]\)) for liberal and restrictive fluid strategies, respectively. The benefit score spanned from −1.75 to 1.21, with higher values indicative of a more favorable outcome with the restrictive strategy. In the validation cohort, these metrics were 48.1 and 19.4 days for groups recommending liberal and restrictive fluid strategies, respectively (Supplementary Fig. 4). On average, there was an increase of 19.9 days in time-to-event if all patients adhered to the fluid strategy recommended by the benefit score. This effect corresponds to a hazard ratio of 0.82 (95% confidence interval (CI): 0.64–0.92). When applied to the baseline reference hospital mortality rate of 16%, adherence to the recommended fluid strategy could potentially lower this rate to 13%.

Fig. 2: Survival curves stratified by the recommended and received subgroups, with the shaded area indicating the 95% CI.
figure 2

Groups are denoted as “Recmd_Actual,” where, for instance, “Liberal_Restrict” signifies a recommendation for a liberal fluid strategy but actual receipt of a restrictive one. Censored patients are denoted by small “+“ symbols. Data are presented as mean values and a 95% CI.

Table 2 Cox proportional model incorporating time-varying covariates

Supplementary Table 3 presents the clinical disparities between recommended liberal and restrictive fluid strategy groups. Figure 3a illustrates the nuanced clinical differences among groups that recommended different fluid strategies. It is observed that patients predicted to benefit from restrictive fluid resuscitation predominantly presented with more severe pulmonary and renal injuries, indicated by higher FiO2 requirements. Despite minimal clinical feature space separation between the restrictive and liberal groups, as depicted in Fig. 3b, a pronounced distinction was noted in the transcriptomic space, as shown in Fig. 3c.

Fig. 3: Comparison of clinical and transcriptomic features between patients who may benefit from restrictive fluid strategy and those who may benefit from liberal strategy.
figure 3

a The two groups, liberal (n = 260) and restrictive (n = 92), were compared for their differences in clinical features. Analysis of variance (ANOVA) was employed to assess the statistical differences in each variable represented on the x-axis across the four groups. Significance levels are indicated with asterisks as follows: *p < 0.05, **p < 0.01, and ****p < 0.001. Data are presented as mean values ± SEM, and it is noted that all variables have been normalized for comparison purposes. b A plot matrix is presented to illustrate the distribution of samples within the clinical feature space, as defined by the first three principal components. c Similarly, another plot matrix shows the distribution of samples within the transcriptomic feature space, also based on the first three principal components. urine Urine volume; plt platelet count, pcvo central venous oxygen saturation, sapmin minimum arterial lactate, cl creatinine level, tmin minimum body temperature, scvo mixed venous oxygen saturation, wbc white blood cell count, paco arterial carbon dioxide pressure, inr international normalized ratio, rrmax maximum respiratory rate, alb albumin, k potassium, mapmax maximum mean arterial pressure, ddimer D-dimer, hct Hematocrit, mapmin minimum mean arterial pressure, gcs Glasgow coma scale, pao arterial oxygen pressure, aptt activated partial thromboplastin time, lac lactic acid, SaO2 arterial oxygen saturation, procal procalcitonin, phcv pH of central venous blood, na sodium, ca calcium, hrmax maximum heart rate, abe base excess, tt thrombin time, tmax maximum body temperature, crp C-reactive protein, sapmax maximum arterial lactate, age age, hrmin minimum heart rate, rrmin minimum respiratory rate, bilirubin bilirubin, height height, cr creatinine clearance, pha pH value of arterial blood, pcvco carbon dioxide of central venous blood, bun blood urea nitrogen, fio2 fractional inspired oxygen, SOFA sequential organ failure assessment score, scaled_value scaled value.

Given the importance of reconciling the gap between recommended and actual treatment, we categorized patients into four distinct groups based on the fluid strategies that were recommended and those that were ultimately administered. Despite no significant differences in commonly recorded clinical features, suggesting limitations in using routine variables for fluid strategy prescription, further comparison of clinical variables revealed significantly higher urine output in the liberal-liberal group and elevated SOFA score in the restrictive-restrictive group (Supplementary Fig. 5). While the recommended restrictive and liberal groups exhibited minimal separation in the clinical feature space (Supplementary Fig. 6), greater distinctiveness was observed in the transcriptomic space (Supplementary Fig. 7). Over the course of sepsis from day 1 to 5, a trend towards recommending restrictive fluid strategy emerged (Supplementary Fig. 8). In our study, we observed distinct gene expression profiles between patients who benefited from restrictive vs liberal fluid strategies. For instance, higher expressions of CLDN8, TPO, DIO2, and IYD, were associated with better outcomes in the restrictive fluid group. Claudin proteins are components of tight junctions in epithelial and endothelial cells. CLDN8 specifically has been implicated in the regulation of vascular permeability19,20, making it a candidate gene that could be associated with vascular endothelial dysfunction in sepsis. DIO2 and IYD are involved in the activation of thyroid hormones. Thyroid hormones can influence the cardiovascular system, including endothelial function21,22. Figure 4 offers a sophisticated molecular dissection of the responses to restrictive and liberal fluid strategies in sepsis. The differential gene expression analysis (Fig. 4a) elucidates the molecular divergence between patients potentially benefiting from restrictive vs liberal fluid strategies. The enrichment analysis reveals a notable modulation of biological processes related to sensory perception and neurotransmission (Fig. 4b). This indicates that fluid strategy may exert neuromodulatory effects, influencing pathways that could be implicated in the neuroinflammatory sequelae of sepsis, including cognitive impairment and pain perception alterations. Further examination of metabolic pathways uncovers perturbations in fundamental metabolic processes, including glycolysis and cholesterol biosynthesis (Fig. 4c). These metabolic disturbances may reflect the body’s adaptive or maladaptive efforts to maintain energy homeostasis under the influence of different fluid management approaches. Figure 4d provides a deeper exploration of the immunological and hematological responses. The observed enrichment of pathways related to platelet activation and immunoglobulin production suggests a differential impact on hemostasis and the humoral immune response.

Fig. 4: Transcriptomic profiles of the recommended restrictive vs liberal fluid strategy groups.
figure 4

a A volcano plot illustrates differentially expressed genes between restrictive and liberal fluid strategy groups. Positive values denote higher expression in the restrictive group than the liberal group. p-values were adjusted using the Benjamini–Hochberg procedure. b The dot plot displays gene ontology (GO) enrichment pathways for the differentially expressed genes. Adjusted two-sided p-values are indicated by color, while circle size represents gene count. c The dot plot exhibits Molecular Knowledge-based Kyoto Encyclopedia of Genes and Genomes (MKEGG) enrichment pathways for the differentially expressed genes. Adjusted two-sided p-values are depicted by color, with circle size indicating gene count. d A tree plot presents GO enrichment pathways for differentially expressed genes using gene set enrichment analysis (GSEA). Adjusted two-sided p-values are visualized by color, while circle size reflects gene count. MKEGG refers to the “molecular knowledge-based” pathway database provided by the Kyoto Encyclopedia of Genes and Genomes (KEGG). This database contains curated information on molecular interactions, reactions, and pathways in various biological systems, including metabolic pathways, signaling pathways, and diseases. In the figure, Ensembl IDs and gene symbols are used. Some genes do not have official gene symbols because they were recently discovered or annotated and might not have been evaluated and approved. Some genomic elements, such as pseudogenes or certain types of non-coding RNA, may not be given symbols because they do not encode proteins or have known regulatory functions. For instance, the Ensembl ID ENSG00000264063 corresponds to an RNA gene that is specifically categorized under the class of microRNAs (miRNAs). As a non-coding RNA, it has not been assigned a conventional gene symbol.

Association of sepsis endotypes with the fluid resuscitation benefit score

Sepsis endotypes were discerned through the application of the Partitioning Around Medoids (PAM) method, which evaluated gene expression profiles. The analysis revealed two distinct sepsis endotypes. Cluster 1 was distinguished by an exaggerated hyperinflammatory profile, whereas Cluster 2 was marked by a relative downregulation of pro-inflammatory cytokines (Fig. 5a), corroborating findings from existing literature18,23,24,25. The clustering outcomes were then integrated with the previously determined benefit scores, demonstrating that Cluster 2 patients potentially responded better to a restrictive fluid strategy (Fig. 5b). A significant differential expression was observed between the endotypes for a multitude of genes, encompassing both protein-coding and non-coding RNAs (Fig. 5c).

Fig. 5: Identification of two sepsis endotypes in the transcriptomic space.
figure 5

a This panel provides a visualization of the sepsis endotypes in the space defined by the first two principal components. The clusters are differentiated by color and point shape, indicating the distinct transcriptomic profiles of each endotype. b The distribution of the benefit score within the PCA space is depicted, with the red color signifying patients who are more likely to benefit from a restrictive fluid strategy, and the blue color indicating those who may benefit from a liberal fluid strategy. c A volcano plot is presented to illustrate the differentially expressed genes between the two sepsis endotypes. Genes are identified using their Ensembl IDs and gene symbols. It is noted that some genes, particularly those recently discovered or annotated, may not have official gene symbols. Additionally, certain genomic elements such as pseudogenes or specific types of non-coding RNA, which do not encode proteins or have established regulatory functions, may lack symbols. Adjusted two-sided p-values were reported. d A tree plot is utilized to present GO enrichment pathways for the differentially expressed genes, as determined by GSEA. The adjusted two-sided p-values are represented by color intensity, with the circle size corresponding to the count of genes within each pathway.

Functional enrichment analysis of these genes indicated their involvement in various immunological pathways, including leukocyte-mediated immunity, adaptive immune responses, immunoglobulin-mediated immune reactions, and T-cell activation (Fig. 5d). These biological pathways are pivotal in the immunopathogenesis of sepsis and may offer insights into the heterogeneity of treatment responses among different sepsis endotypes.

Proteomic signature for recommending the fluid strategy

We developed a proteomic signature to leverage the stability and direct biological relevance of proteins in the clinical setting, despite the close association of transcriptomic data with clinical outcomes. The proteomic approach facilitates rapid measurement with standard clinical assays, supporting practical application in sepsis management26. We evaluated 363 plasma proteins and conducted PCA to explore the variability and confounders (Fig. 6). The PCA revealed a moderate distinction between survivors and non-survivors (Fig. 6a), no significant batch effects across hospital samples (Fig. 6b), and patients with elevated SOFA scores were clustered in the upper-left quadrant of the plot (Fig. 6c).

Fig. 6: Principal component analysis (PCA) biplot of protein expression profiles by clinical, biological, and technical variables.
figure 6

These dimension reduction plots present the results of a PCA on protein expression data, illustrating the variance in protein expression across different clinical, biological, and technical variables. The first two principal components, Dim1 and Dim2, account for 25.9% and 11.8% of the total variance, respectively (a). PCA Biplot colored by mortality status, highlighting the separation between Survivors (Y) and non-survivors (N). The projection of individual data points shows the distribution of samples along the principal components, with color intensity indicating the presence of mortality (b). PCA Biplot differentiated by site/hospital. Each site/hospital is represented by a distinct symbol, demonstrating the clustering and dispersion of samples based on their protein expression profiles (c). PCA Biplot colored by the SOFA score, a measure of organ dysfunction. The gradient from light to dark represents increasing SOFA scores, reflecting the severity of organ failure in the samples (d). Close-up view of the PCA biplot focusing on the individual variables (proteins) GAL, NFASC, MICB_MICA, MAP2K6, JCHAIN, and CSF3. The arrows represent the direction and magnitude of the contribution of each protein to the principal components. The angle and length of these arrows indicate the correlation of each protein with the principal components. The color gradient from blue to red on the visualization indicates escalating benefit scores, with blue representing lower scores and red indicating higher, positive values that suggest a beneficial response to restrictive fluid strategies. CSF3 colony-stimulating factor 3, GAL galanin, NFASC neurofascin, MICB_MICA MHC class I polypeptide-related sequence B and MICA, MAP2K6 mitogen-activated protein kinase kinase 6, JCHAIN joining chain of multimeric IgA and IgM, SOFA sequential organ failure assessment, PCA principal component analysis, Dim1 dimension 1, Dim2 dimension 2, N no, Y yes, BinZhou, ChangDe, GYEY, JinHua, PiDu, SSR, XiaMen, XiAn2fy, XMZSH, ZiGong, ZZCH site or hospital identifiers.

Differential expression analysis revealed significant differences between the restrictive and liberal fluid strategy groups, highlighting CSF3, MICB_MICA, and CXCL9 (Fig. 7a). Gene set enrichment analysis (GSEA) indicated enrichment in pathways related to bacterial response, lipid cellular response, and steroid hormone response (Supplementary Fig. 9). Important biological pathways such as interleukin-10 signaling, regulation of hormone secretion, and response to oxygen-containing compounds were identified (Supplementary Fig. 10). Using a LASSO regression model against the benefit score, a proteomic signature comprising GAL, NFASC, MICB_MICA, MAP2K6, JCHAIN, and CSF3 was derived (Table 3, Supplementary Fig. 11, and Fig. 7b). The six proteins, highlighted in Supplementary Table E4 and integral to our predictive nomogram, play pivotal roles in orchestrating immune responses and endothelial damage27,28. Despite the current assays requiring 3–5 h to yield results, there is a pressing requirement for their optimization to expedite this process. Streamlining the assay timeline is crucial for the ICU setting, where the speed of clinical decision-making can influence patient outcomes. The performance of this model in recommending restrictive vs liberal fluid strategy yielded an area under the curve (AUC) of 0.802 (95% CI: 0.752 to 0.846; Fig. 7c), which was comparable to that developed by transcriptomic data (AUC = 0.845, 0.797–0.884). It is also noted that other machine learning models do not show significant improvement over the logistic regression model in classifying subgroups of patients who may benefit from a restrictive fluid strategy (Fig. 7c).

Fig. 7: Development of a protein signature model for recommending liberal or restrictive fluid strategy.
figure 7

a Comparative analysis of plasma protein expression levels between the restrictive (n = 102 person-days) and liberal (n = 311 person-days) fluid strategy cohorts. Two-sided p-values were adjusted using the Benjamini–Hochberg procedure. b Nomogram demonstrating the estimation of benefit scores based on plasma protein levels. The red lines represent a sample patient, with a score of −0.42, suggesting potential benefit from the liberal strategy. Two-sided p-values are reported for each variable; Significance levels are indicated with asterisks as follows: *p < 0.05, **p < 0.01, and ***p < 0.005. c ROC plot illustrating the diagnostic accuracy of the protein signature for identifying patients who may benefit from the restrictive fluid strategy.

Table 3 Protein markers in the LASSO model for prediction of benefit score

Discussion

The study devised a benefit score based on transcriptomic data to guide fluid management in septic shock patients. This score ranged from −2 to 2, with higher values indicating potential benefits from a restrictive fluid strategy. The restrictive approach involved administering less fluid and early vasopressor use to maintain target blood pressure. Validation was conducted in an independent, non-overlapping cohort recruited from different hospitals during the same period as the training cohort. Due to the technical challenges associated with measuring RNA expression levels, a proteomic signature was further developed to facilitate clinical translation. Six proteins were identified from a pool of 363 plasma proteins. The proteomic signature demonstrated moderate performance in differentiating patients who might benefit from a restrictive fluid strategy. Additionally, a nomogram was devised to aid in the clinical application of the findings. Our study’s novelty lies in its comprehensive multi-omic approach, offering a unique perspective on sepsis heterogeneity. By identifying a distinct proteomic signature associated with differential responses to fluid resuscitation, our work contributes to the advancement of personalized sepsis management. This tailored approach has the potential to optimize treatment strategies, thereby enhancing clinical outcomes and addressing the unmet need for precision in septic shock therapy.

Numerous studies have investigated fluid management strategies in patients with septic shock or sepsis-induced hypotension9,12,29. Despite variations in protocols, these studies have not demonstrated the clear benefits of either restrictive or liberal fluid strategies29. We hypothesize that sepsis encompasses a heterogeneous population, with certain subgroups potentially benefiting from restrictive fluid strategies while others do not23,30,31,32. Previous attempts to identify patient subgroups using unsupervised machine learning algorithms have been limited by their inability to incorporate interventions, thus hindering their ability to guide treatment decisions32. In our study, we introduced a benefit score designed to guide fluid management strategies. Unlike traditional approaches such as outcome modeling and unsupervised clustering, our method prioritizes modeling treatment-covariate interactions over main covariate effects. This emphasis stems from the fact that subgroup identification hinges solely on interaction signs33,34. The benefit score, introduced in our study, quantifies the individualized treatment effect by capturing the magnitude of interaction. This score facilitates the recommendation of restrictive fluid strategies using varied cutoff values tailored to real clinical scenarios. By integrating interventions into causal inference models, we aim to comprehensively explore the interplay between treatment and covariates driving heterogeneity of main effects. Distinct gene expression patterns were observed among patients differentially responding to restrictive fluid strategies, with implicated genes participating in critical biological processes such as cell activation, membrane potential regulation, platelet activation, complement cascade, and cell–cell adhesion. In sepsis, the activation of immune cells, predominantly leukocytes, triggers the release of inflammatory mediators that impair endothelial function, leading to enhanced vascular permeability through the disruption of tight junctions and the glycocalyx layer. Upregulated expression of cell–cell adhesion molecules, including cadherins and selectins, facilitates leukocyte binding to the endothelium, further compromising the glycocalyx and augmenting vascular permeability35,36. The complement system, an integral component of the innate immune response, is also activated in sepsis. Its activation results in the production of factors that enhance endothelial expression of adhesion molecules, thereby increasing vascular permeability37. Complement-mediated cleavage of glycocalyx components exacerbates barrier dysfunction38. The aforementioned biological processes are closely associated with endothelial function and vascular barrier integrity. Disruptions therein can precipitate heightened vascular permeability and glycocalyx injury, which are pivotal in the pathogenesis of sepsis and its clinical sequelae. These findings underscore the complex interplay between gene expression, endothelial biology, and the hemodynamic perturbations observed in septic patients.

Genes chosen to construct the benefit scores encompass both protein-coding and non-coding genes, with regulatory mechanisms too intricate for our study’s scope. Given that proteins serve as the ultimate effectors, utilizing plasma protein levels may offer greater biological relevance for recommending fluid strategies. The proteins comprising our proteomic signature are biologically explicable. For instance, CSF3 regulates the production, differentiation, and function of granulocytes, innate immune cells crucial for immune response39,40. This protein is commonly used to manage chemotherapy-induced neutropenia. Granulocytes also release vasodilatory mediators contributing to septic shock, wherein a liberal fluid strategy aids in restoring effective circulatory volume. Hence, higher CSF3 values are associated with increased benefits from a liberal fluid strategy.

Several study limitations warrant acknowledgment. Firstly, A critical consideration in our study design was the categorization of fluid strategies. While this approach has the advantage of providing clear categorization for analysis, it also introduces potential limitations. The choice of the 3000 ml threshold and the SOFA_cv score cutoff are based on clinical judgment and existing literature, which may not be universally applicable across all patient populations or settings. Furthermore, the categorization may not fully capture the complexity and individual variability in clinical decision-making regarding fluid management. The potential for variability in fluid strategy application could introduce bias or confounding in our analysis. Additionally, the observational nature of our study and reliance on recorded data means that we cannot account for all the nuances and rationale behind fluid management decisions made by treating physicians. Future studies with a prospective design, standardized fluid management protocols, and a larger and more diverse patient population are needed to validate and refine our approach. Secondly, the relative protein values utilized in the study necessitate batch normalization, highlighting the need for validation of the proteomic signature’s accuracy through ELISA-tested plasma protein concentrations. We also acknowledge that the current assays for GAL, NFASC, MAP2K6, and others do not provide results within the immediate timeframes (<1 h) required for ICU resuscitation. This limitation underscores the need for the development of rapid diagnostic tools, which is a critical area for future research. In the current study, one limitation is the absence of data regarding the specific types of fluids used for resuscitation, be it balanced or unbalanced. The choice of fluid can potentially influence clinical outcomes, particularly in the context of sepsis, where fluid management is a critical component of treatment41,42. While our analysis did not account for this variable, we acknowledge that the type of fluid may play a pivotal role in patient responses and survival, especially considering the evolving understanding of sepsis subphenotypes and their tailored management strategies43. While the findings of this study provide valuable insights into individualized fluid strategy, it is important to recognize that our results are primarily applicable to patients whose data closely resemble those in our training set. The generalizability of these results to other populations or settings may be limited due to differences in patient demographics, comorbidities, and clinical practices. Future studies are needed to validate the model’s performance in diverse populations and to explore its applicability in different clinical scenarios.

To effectively translate our findings into clinical practice, several pivotal steps must be undertaken. Firstly, we must subject our model to prospective validation to ensure its predictive accuracy in diverse patient populations. Secondly, the development of rapid and sensitive assays for the identified proteomic signature is essential for practical clinical application. Lastly, a cost-effectiveness analysis will assess the economic viability of our approach, considering both direct costs and potential savings from improved patient outcomes.

In conclusion, this study presents a proteomic signature with potential applications in guiding fluid strategy for patients with septic shock. However, further validation studies are necessary to establish the causal relationship between fluid strategy and survival outcomes.

Methods

Study setting and patient enrollment

The study received approval from the ethics committee of Sir Run Run Shaw Hospital (approval number: 20201014-39), and written informed consent was obtained from patients or their next-of-kin surrogates. Participants received non-monetary compensation in the form of access to their personal data, providing potential insights into their health and personalized recommendations. Conducted within the Chinese Multi-omics Advances In Sepsis (CMAISE) consortium between November 2020 and December 2023, this study involved 35 Chinese hospitals44. The study protocol was registered with the Chinese Clinical Trial Registry (http://www.chictr.org.cn/; ChiCTR2000040446)45. Patient eligibility was based on meeting the Sepsis-3.0 criteria (suspected or documented infection plus acute increase in SOFA score > 2 points) upon admission to the intensive care unit (ICU)46. Septic shock was defined as those with hypotension (mean arterial pressure <65 mmHg) requiring vasopressors despite adequate fluid resuscitation. Exclusion criteria included: (1) end-stage cirrhosis with Child‒Pugh C; (2) concomitant malignancy or autoimmune disease; (3) do-not-resuscitate order; (4) pregnancy; (5) sepsis onset > 48 h or treatment at other hospitals when presenting to CMAISE member hospitals; (6) immunosuppression, such as long-term use of immunosuppressive agents, chemotherapy, corticosteroids, radiotherapy, or HIV infection; and (7) acute myocardial infarction and/or pulmonary embolism. The inclusion criteria were designed to ensure a representative sample across various demographic factors, including sex and age.

Blood sample preparation and testing

Blood samples were collected on days 1, 3, and 5, with peripheral whole blood samples obtained using PAXgene tubes and subsequently stored at −80 °C. These samples underwent library preparation and gene expression quantification by LC-Bio Technologies (Hangzhou) Co., LTD. The DESeq2 pipeline was utilized for differential gene expression analysis, with genes having less than 100 counts in all samples being excluded47. A variance stabilizing transformation (VST) was applied to normalize the count data, resulting in a homoscedastic matrix of values across the range of mean values. Batch effects originating from different institutions were mitigated using a design matrix incorporating sample source information. Differential gene expression between restrictive vs liberal strategy groups was depicted using volcano plots. Additionally, GSEA utilizing gene ontology (GO) terms was performed with the clusterProfiler package (version 4.12.0)48, to assist in deciphering the biological significance of the differentially expressed genes. This approach is crucial for understanding the broader functional implications of the observed gene expression changes within our study49,50,51.

The Olink® inflammation panel (Olink Proteomics AB, Uppsala, Sweden) was utilized for protein quantification in accordance with the manufacturer’s instructions. The protocol relies on the proximity extension assay (PEA) technology, wherein pairs of oligonucleotide-labeled antibody probes selectively bind to target proteins52. Upon close proximity, these probe pairs undergo pair-wise hybridization of the associated oligonucleotides. The subsequent introduction of a DNA polymerase initiates proximity-dependent DNA polymerization, resulting in the formation of a distinct PCR target sequence. This DNA sequence is then detected and quantified using a cutting-edge microfluidic real-time PCR instrument (Signature Q100, LC-Bio Technology CO., Ltd., Hangzhou, China). Subsequent analyses, including differential and enrichment analyses, were carried out following established protocols. Specifically, gene set enrichment analysis (GSEA)48 was conducted using the Molecular Signatures Database (MSigDb)53,54 and executed with the clusterProfiler R package (version 4.12.0).

Important clinical variables and definitions

Upon admission, demographic information such as age, sex (based on hospital records), height, and weight were recorded as described previously. Laboratory parameters, including C-reactive protein, serum creatinine, urine output, procalcitonin, and coagulation profiles, were evaluated on days 1, 3, and 5. Fluid intake was documented on days 1, 3, and 5.

In our study, patients with septic shock were categorized into two groups based on their fluid management strategy: restrictive and liberal. The ‘restrictive fluid strategy’ was defined by two criteria: a SOFA cardiovascular score (SOFA_cv) greater than 2 points, indicating a priority for vasopressor use, and a daily fluid intake of less than 3000 ml. Conversely, a ‘liberal fluid strategy’ was characterized by either a SOFA_cv score of 2 points or less, or a daily fluid intake of 3000 ml or more.

The selection of the 3000 ml threshold for daily fluid intake was informed by a review of existing literature7,11,55,56, and aimed to ensure a sufficient number of patients in each group for robust statistical analysis and to allow for the identification of distinct subgroups within the septic shock population that might respond differently to fluid management strategies. The fluid strategy was assessed on days 1, 3, and 5 of the study period, capturing the initial and critical phase of septic shock treatment known to impact patient outcomes.

Identification of subgroups showing differential responses to the fluid strategy

The analysis piple included four steps: (1) create a propensity score function and assess the diagnostics of the propensity scores. (2) Select and train a model for subgroup identification. (3) Compute the treatment effects within the identified subgroups. (4) Visualize and analyze the treatment effects of the model and subgroups57. Essentially, we want to see if treatment affects patients’ outcomes differently. We consider patient characteristics (whole blood transcriptome), treatment status (liberal vs restrictive strategy), and the outcome of interest (survival outcome). The main goal is to estimate a “benefit score” for each patient, which tells us how much they’re expected to benefit from treatment. If the score is positive, the treatment is likely beneficial for them; if it’s negative or zero, the control might be better. This helps us identify which patients benefit from treatment58.

Validation of the model was conducted in a separate cohort that did not overlap with the original one. The validation data was withheld during model development and the analysts were blinded to the validation dataset throughout the model building process. The validation of our model was performed independently by a statistician who was not involved in the training of the model. In this new cohort, we computed a “benefit score” for each patient, where higher scores suggested a higher likelihood of recommending a restrictive fluid strategy. The objective was to determine whether patients with congruent recommended and administered fluid strategies exhibited superior survival outcomes compared to those with disparate strategies. Given that patients were followed and had blood samples collected for transcriptome and proteomics analysis on days 1, 3, and 5, we recognized the need to account for changes in patient status and biomarker levels over time. The time-varying nature of these covariates necessitated the use of a Cox model that could accommodate such changes59,60. This approach allows for the incorporation of time-dependent covariates, reflecting the dynamic nature of the clinical course in sepsis patients.

Proteomic signature to facilitate clinical translation

To enhance clinical translation, we focused on developing a proteomic signature. Plasma proteins offer greater stability compared to transcriptome data and exhibit strong correlations with clinical chemistry parameters61. Leveraging this connection, we aimed to create a signature capable of identifying patient subgroups benefiting from either liberal or restrictive fluid strategies. Using the Least Absolute Shrinkage and Selection Operator (LASSO)62, we filtered significant proteins from a pool of 363 plasma proteins. Subsequently, we constructed a nomogram for a simplified clinical application of the subgroup prediction model (Fig. 1). The efficacy of the model in terms of predictive accuracy was determined by calculating the area under the receiver operating characteristic (ROC) curve. To ascertain the precision of this estimate, a non-parametric bootstrap technique was employed for calculating the 95% CI for the AUC. This approach, known for its robustness, involves resampling the data with replacement to generate an empirical distribution of AUCs from which the CI is derived. To ensure robustness, prevent overfitting, and enhance the generalizability of the findings, multiple models were evaluated, including support vector machine (SVM), random forest, neural networks, and gradient boosting. This evaluation aimed to determine whether these models offer significant improvement over the logistic regression model. Default settings in the parsnip package (v1.2.1) were employed for training these models.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.