Introduction

Mycoplasma pneumoniae pneumonia (MPP) is a common childhood respiratory disease caused by Mycoplasma pneumoniae (MP) infection1. According to a 10-year study on respiratory tract surveillance in China, MP is the atypical pathogen with the highest detection rate in children aged 5 to 7 years2. Although MPP is generally considered to be a self-limiting disease and most children have mild conditions with a good prognosis3, in some special cases, it can also progress into severe MPP (SMPP) or refractory MPP (RMPP) complicated by serious intrapulmonary and extrapulmonary complications4. Intrapulmonary complications include pleural effusion, atelectasis, necrotizing pneumonia (NP), and pulmonary embolism, etc. In severe cases, respiratory failure and hypoxemia may occur, which can be fatal5. In addition to causing lung damage, MPP can lead to a range of extrapulmonary complications, including myocardial injury, abnormal liver function, kidney damage, anemia, central nervous system encephalitis, and thrombosis6,7.

In recent years, the incidence of SMPP has gradually increased, rising from 0.7% in 2006 to 35% in 20168. The specificity of early lung manifestations in children with SMPP is poor, making it difficult to distinguish from lung injury caused by other pathogens. Despite the growing availability of molecular techniques for pathogen detection, including quantitative and qualitative pathogen detection, laboratory results are usually only available after treatment decisions have been made. Moreover, with the widespread application of macrolides in children, drug-resistant MP strains have become increasingly common. In China, more than 85% of MP strains among pediatric patients have been reported as macrolide-resistant9. Consequently, precise prediction and identification of SMPP in clinical practice are crucial to reduce mortality and complications and to improve the prognosis of SMPP. At present, few epidemiological and comparative studies focus on the differences among viral pneumonia, bacterial pneumonia, and MPP, which may result in the abuse of antibiotics and increased microbial resistance to antibiotics10.

There are no well-defined and unified diagnostic criteria for pediatric SMPP. MPP patients with extensive lung lesions spreading over more than two thirds of the chest area in radiographic image, or those developing intra‐and extra‐pulmonary complications, can all be considered as SMPP. Clearly, SMPP has been recognized to present diverse clinical phenotypes, including pulmonary and non-pulmonary complication subtypes. Extrapulmonary complications can occur at any time following the onset of MP infections, even in asymptomatic cases, affecting up to 25% of those infected with MP11. A study shows that cardiovascular disease accounts for approximately 27.91% of extrapulmonary complications in MPP patients12. Factors leading to myocardial damage are significant for the early diagnosis and prevention of cardiac complications. Additionally, reports indicate that MP-related hepatitis occurs in approximately 10–30% of cases, mostly in children. This condition can manifest as asymptomatic elevation of liver enzymes, inhibition of multiple coagulation factors, or cholestasis13,14,15. Therefore, ignoring MP-related hepatitis may delay diagnosis and lead to serious liver damage during MPP. Therefore, establishing an accurate and reliable predictive model is crucial for preventing the occurrence of related organ damage in the early of MPP.

Artificial intelligence (AI) and machine learning (ML) are increasingly recognized as powerful tools for handling complex medical tasks. Machine learning algorithms excel at exploring intricate relationships within multidimensional data, extracting hidden and valid knowledge from vast datasets, and making more accurate predictions and diagnoses of diseases16. Historically, numerous studies have developed MPP prediction models focusing on diagnosis, severity, risk factors, treatment, and prognosis17,18,19,20. These studies primarily relied on constructing nomogram models for the early identification and intervention of MPP. Nomogram is a traditional calculation tool that includes variables and corresponding scoring lines. However, complex ML methods can manage a broader array of variables, often yielding more accurate and precise results than traditional modeling methods21. Additionally, one of the main challenges in applying MPP prediction models to clinical practice is the lack of external validation19,20,21,22.

Consequently, this study aims to use machine learning algorithms, based on the first laboratory test results of SMPP patients after their visit, to construct and validate an early warning model that could be applied in early identification and intervention of SMPP, and the prevention of disease progression. Secondly, it aims to develop a risk prediction model for liver and heart damage in SMPP to improve the prognosis and outcomes of SMPP patients. The performance of the models is validated using an external validation cohort from another hospital.

Methods

Study design and study population cohorts

This multicenter observational retrospective study was conducted at two hospitals. Cohort 1 (model 1 construction and internal validation): Patients with SMPP, viral and bacterial pneumonia who first visited the First Hospital of Jilin University from 2021 to 2023 were included. They were divided into SMPP and other respiratory diseases groups to establish a differential diagnostic model for distinguishing between SMPP and other respiratory diseases. Cohort 2 (model 2 construction and internal validation): The SMPP group was divided into myocardial damage, liver damage, and non-damage groups for subgroup analysis to predict the types of extrapulmonary organ damage in SMPP early. Cohort 3 (external validation): Patients with SMPP, viral and bacterial pneumonia who first visited the Meihekou Central Hospital from 2023 to 2024 were included for external validation using the same treatments as in Cohort 1 and Cohort 2. This study was approved by the Ethics Committee of the First Hospital of Jilin University (NO. 2020-313). Due to the retrospective nature of the study, the requirement for written informed consent was waived by the Ethics Committee. The research followed the Helsinki Declaration23.

Criteria for defining SMPP: Patients were diagnosed with SMPP according to the Guidelines for Diagnosis and Treatment of Mycoplasma Pneumoniae Pneumonia in Children (2023 Edition). MPP cases were categorized as SMPP if patients exhibited any of the followings: (1) Poor general condition; (2) Conscious disorder, cyanosis, respiratory dysfunction; (3) Hypoxemia, assisted respiration (groan, nasal fan, three concave sign), intermittent apnea, and oxygen saturation ≤ 92%; (4) Persistent hyperpyrexia for more than 5 days or ultra-hyperpyrexia; (5) Dehydration and food refusal; (6) Chest X-ray or CT scan showing the following findings: unilateral lung infiltration ≥ 2/3, multi-lobar lung infiltration, pleural effusion, pneumothorax, atelectasis, lung necrosis, lung abscess; (7) Extrapulmonary complications.

Inclusion criteria: (1) Patients were younger than 18 years old; (2) Clinical data of the patients were complete. Exclusion criteria: (1) Evidence of co-infection, including bacteria, viruses, fungi, and tuberculosis, etc.; (2) Condition in the recovery period at the time of admission (patients with a disease course of more than 4 weeks, stable temperature for more than 1 week, improvement of chest imaging); (3) Pre-existing other systemic diseases, such as congenital heart disease, chronic kidney disease, chronic lung disease, connective tissue disease, hematological diseases, and tumors; (4) Patients who have received recent anticoagulant treatment or were using anticoagulants for other medical conditions; (5) Patients with a recent history of major surgery, serious trauma, or blood transfusion; and (6) Patients with an immunodeficiency or other diseases that could cause abnormal immune function, as well as those who had recently received immunotherapy or hormone therapy.

Inclusion criteria for other respiratory diseases: (1) Single infection patients with COVID-19, influenza A, and bacterial pneumonia; (2) COVID-19 and influenza A cases confirmed by reverse-transcriptase-polymerase-chain-reaction (RT-PCR); (3) Bacterial pneumonia cases confirmed by detecting at least one putative bacterial pathogen in blood or pleural fluid through culture or polymerase chain reaction (PCR). Exclusion criteria: (1) Malignant tumor; (2) Blood system disease; (3) Serious immune system disease.

Data collection

Clinical data and laboratory test results for the study subjects were collected through the laboratory information system and the electronic medical record system stored in the database. Fasting venous blood was collected within 24 h after admission for blood analysis. A chest X-ray or chest CT was performed 3 days before or within 3 days after admission, and the results were recorded. The patient ID number was used as the unique identification of the study subjects. Specifically, the following clinical data were collected for all patients: (1) Basic demographic information, including age and sex. Clinical characteristics included past medical history, clinical symptoms, length of hospital stay, type of pathogen(s), pre-treatment duration with macrolides, intrapulmonary and extrapulmonary complications, and laboratory and imaging findings. (2) Findings of physical examinations, including respiratory rate, heart rate, and breath sounds. (3) Laboratory tests included white blood cell counts (WBC), hemoglobin (HGB), absolute neutrophil count (ANC), lymphocyte count (LYC), platelet Count (PLT), C-reactive Protein (CRP), procalcitonin (PCT), lactate dehydrogenase (LDH), D-dimer, fibrinogen (FIB), activated partial thromboplastin time (APTT), prothrombin time (PT), prothrombin time activity (PTA), thrombin time (TT), alanine aminotransferase (ALT), aspartate aminotransferase (AST), albumin (ALB), blood urea nitrogen (BUN), blood electrolytes, creatine kinase (CK), MB isoenzyme (CK-MB), hydroxybutyrate dehydrogenase (HBDH), and cardiac troponin I (CTnI), etc. Serum Mycoplasma pneumoniae antibody and MP polymerase chain reaction (PCR) were performed to determine the presence of MP infection. We used virus antigen detection assays and RT-PCR methods to confirm pathogen identification and exclude coinfection. The diagnostic tests were determined based on clinical judgment. These tests were performed on samples taken from various sources, including nasopharyngeal swabs, throat swabs, sputum samples, pleural effusion samples, and bronchoalveolar lavage fluid. The test results of electrocardiogram (ECG), echocardiography, and abdominal ultrasound of the study subjects were collected from the image archiving and communication system.

Data cleaning and normalization

To improve data quality and ensure accuracy, consistency, and availability, we cleaned and standardized the collected raw medical data: (1) Data inspection and cleaning: After summarizing and sorting the raw data, the display formats of values, time, date, and full half-angle were integrated for consistency. Repeatability was checked, and exact duplicate data were eliminated; (2) Data normalization: The four elements of specimen type, test item name, test result unit, and test reference value were calibrated and normalized; (3) Data exclusion and padding: Tests with missing rates > 30% were excluded. For tests with missing rates ≤ 30%, a filling method representing the central tendency of the variables was selected for the measurement data: median replacement, mean replacement, or plural replacement; the count data were randomly interpolated according to the proportion of available negative and positive data.

Dealing with imbalanced datasets

In imbalanced datasets, there were differences in the number of samples between different classes, which could result in poor classification performance of the model for minority classes. To address the issue of sample imbalance, we adopted the SMOTE method to balance the datasets. The SMOTE technique was a type of oversampling method. The SMOTE technique generated randomly new examples or instances of the minority class from the nearest neighbors of line joining the minority class sample to increase the number of instances. As a result, new training datasets were generated. In Cohort 1, SMOTE increased the SMPP group from 1020 instances of the minority class to 1182 patients. In Cohort 2, SMOTE increased the liver damage and non-damage groups from 300 and 354 instances of the minority class to 366 patients.

ML algorithms and model Building

The flow chart for building models using machine learning algorithms was shown in Fig. 1. Cohort 1 and Cohort 2 used the 5-fold cross-validation method on the entire dataset, dividing it into five folds. Four of the folds were used as the training set to train the model, and the remaining fold was used as the validation set to score the model. This process was repeated five times, and the average value was taken. The risk prediction models built by Extreme Gradient Boosting (XGBoost), Logistic Regression (LR), Light Gradient Boosting Machine (LightGBM), K Nearest Neighbors (KNN), and Random Forest (RF) were compared. The ML algorithms with the best performance were selected for subsequent model building and validation. The specific parameter values for each algorithm were as follows, XGBoost: The booster selected was gbtree; LightGBM: The boosting type was set to gradient boosting decision trees, learning rate was set to 0.1, max depth was set to − 1, number of estimator was set to 100, number of leaves was set to 31; RF: The criterion was set to gini, max impurity decrease was set to 0, number of estimators was set to 20; KNN: The number of neighbors was set to 5, weighting scheme was set to uniform; LR: Logistic regression used L2 regularization, with C equals 1, max number of iteration was set to 100. The features selected from the optimal algorithm were used for subsequent model building. To further enhance model performance, a refinement step was implemented by ranking the features based on their importance and selecting the top ten features from the best-performing model. This ensures that the models are built on the most influential features, potentially improving their predictive accuracy. Cohort 3 performs external validation of the model.

Fig. 1
figure 1

Flowchart of machine learning to build the Model 1 and Model 2.

Cohort 1 and Cohort 2 datasets were divided into training and validation sets in a ratio of 7:3. Cohort 1 evaluated model performance using Receiver Operating Characteristic (ROC) curves and calculated the Area Under the Curve (AUC) for the model. Calibration curves were generated to evaluate the proximity of the model’s predicted risk to the actual risk, with Decision Curve Analysis (DCA) applied to evaluate the decision utility of the model. Cohort 2 selected the LightGBM model to construct a classification model. Shapley Additive Explanation (SHAP), an interpretable tool for prediction output of machine learning models, could reflect the influence of the features in each data sample. This method could help increase visibility and interpretability of the ML model. In this study, SHAP analysis was used to explain the decision-making process of the model, including sorting the features by importance, showing the association between observed values and risk. To evaluate the performance of the model, accuracy was calculated. The macro and weighted precision, recall, and F1 scores were calculated for the three classes.

Statistical analysis

Excel 2016 was used to store and manage the data, and SPSS 22.0 was used for statistical analysis. Feature selection and model construction were carried out using the Deepwise & Beckman Coulter DxAI platform (http://dxonline.deepwise.com). In the training cohort, least absolute shrinkage and selection operator (LASSO) logistic regression analysis was utilized to rank the importance of risk factors. In LASSO regression, the beta coefficients of variables that were not strongly associated with the outcome were decreased to zero, which removed these variables from the model. Categorical variables were represented as composition ratios. Continuous variables were represented as mean ± standard deviation (SD) or median with interquartile range (IQR), depending on the data distribution. The Student’s t-test, Mann-Whitney U test, or one-way analysis of variance test was used to compare the distribution of variables between groups, as appropriate. Pearson χ2 test or Fisher exact test was used to compare categorical variables, with a two-sided P < 0.05 considered statistically significant.

Results

Basic characteristics of included cohorts

Cohort 1 enrolled 1020 SMPP patients and 1182 patients with other respiratory diseases, including bacterial pneumonia, COVID-19, and influenza A patients. Cohort 2 enrolled 1020 SMPP patients, including 366 with myocardial damage, 300 with liver damage, and 354 without damage. Cohort 3 enrolled 343 SMPP patients and 313 patients with other respiratory diseases. The demographic, clinical, and laboratory characteristics of the study population were shown in Table 1. In Cohort 1, there was a significant difference in age and sex between the SMPP and other respiratory diseases groups. Patients in the SMPP group were significantly older than those in the other respiratory diseases group (p < 0.001). There were significant differences in laboratory indicators (p < 0.001) between the two groups, except for WBC, CK-MB, and LDH (p > 0.05). In Cohort 2, there were no significant differences in characteristics among the three groups, including sex, clinical features, APTT, and PT (p > 0.05). Compared to those with liver damage and non-damage, SMPP patients with myocardial damage had significantly higher CK-MB values (P < 0.001). Compared to those with myocardial damage and non-damage, SMPP patients with liver damage had significantly higher ALT values (P < 0.001). There were significant differences in age and length of stay among the three groups of SMPP patients (P < 0.001). Specifically, the age and length of stay were higher in SMPP patients with liver damage than in the other two groups. Similarly, the remaining laboratory variables differed significantly (P < 0.001) among the three groups. In Cohort 3, there were significant differences in Age, PTA, APTT, Glucose (GLU), HGB, LYC, and CRP between the SMPP and other respiratory diseases groups (p < 0.05), while the three subgroups of SMPP showed significant differences (p < 0.05) in all laboratory indicators. Spearman correlation analysis method was employed to analyze the correlation between different indicators in the models (Supplementary Fig. S1).

Table 1 Demographic, clinical, and laboratory characteristics.

Establishment of diagnostic model 1

In Cohort 1, LASSO regression was used to screen feature variables, resulting in 41 feature variables included as modeling features. The results of the 5-fold cross-validation of the five machine learning algorithms were shown in Table 2. Among them, the LightGBM algorithm performed the best with the highest AUC (0.975), so the model established by the LightGBM algorithm was chosen as the differential diagnostic model for SMPP and other respiratory diseases (Fig. 2). The top 10 features with the highest feature importance in Model 1 were PTA, PT, HGB, APTT, GLU, HBDH, LYC, AST, CRP, and LDH. The ROC curve results showed that Model 1 had excellent differential diagnostic ability (AUCROC=0.975, sensitivity = 0.739, specificity = 0.993) (Fig. 3A). The calibration curve indicated that Model 1’s sample probabilities were in good agreement with the predicted probabilities (Fig. 3B). Decision curve analysis demonstrated that Model 1 had a high clinical benefit (Fig. 3C). Feature importance analysis by SHAP values for the top 10 features was shown in Fig. 3D.

Table 2 Diagnostic efficacy of five classifiers in the validation set for 5-fold cross-validation.
Fig. 2
figure 2

Receiver operating characteristic curves (ROC) showed the predictions of the five models: XGBoost, LightGBM, Random Forest, Logistic Regression, and KNN.

Fig. 3
figure 3

In the internal validation set, the model 1 built by LightGBM algorithm was evaluated for its ability to differential diagnostic for SMPP and other respiratory diseases. (A) Receiver operating characteristic curve analysis; (B) calibration curve analysis; (C) decision curve analysis; (D) the SHAP values.

Establishment of classification model 2

In Cohort 2, to mitigate multicollinearity, correlation analysis was performed to identify indicators with high correlations, and univariate or multivariate analysis was used to select characteristic variables. Subsequently, LASSO regression was employed for feature variable selection. According to the importance of Lasso-selected feature variables, ALT, CK-MB, Age, AST, WBC, LDH, Length of stay, CO2, ALB, and Ca were identified. Based on the afore-mentioned indicators, among the five machine learning algorithms XGBoost, LR, LightGBM, KNN, and RF, the LightGBM model was chosen due to its superior performance in distinguishing the three subgroups of SMPP patients. The LightGBM model was evaluated to classify the three groups in Cohort 2 (accuracy = 0.814). The results, including weighted precision, recall, weighted F1 scores, and support for our classification model, were comprehensively presented in Table 3. Further elucidating the model’s intricacies, the importance of the average SHAP values of each feature in different classes were presented in Fig. 4. ALT and CK-MB emerged as pivotal contributors significantly influencing the model’s output magnitude, while Ca exhibited the lowest SHAP value, indicating a relatively lesser impact on the model’s classification. Notably, ALT was correlated with liver damage, whereas CK-MB was correlated with myocardial damage, both of which showed relatively significant contributions in the corresponding classes.

Table 3 Performance of the built LightGBM model 2.
Fig. 4
figure 4

Feature importance analysis by Shapley additive explanation (SHAP) values. ND non-damage, MD myocardial damage, LD liver damage.

External verification

Using Cohort 3 as an external verification set, the performance of Model 1 and Model 2 were evaluated. In Cohort 3, 343 SMPP patients and 313 patients with other respiratory diseases were enrolled. The external validation results of the differential diagnostic performance of Model 1 were shown in Supplementary Fig. S2. The ROC curve results showed that the Model 1 had stable and excellent SMPP differential diagnosis ability (AUCROC=0.884 [95%CI, 0.860–0.909], sensitivity = 0.769, specificity = 0.831). The 343 SMPP patients included 115 with myocardial damage, 115 with liver damage, and 113 without damage. The overall diagnostic accuracy of Model 2 was 74.9%. Specifically, the model exhibited the highest diagnostic accuracy in identifying non-damage patients with SMPP, with a precision of 0.818. In terms of differentiating myocardial damage patients and liver damage patients, the model demonstrated precision rates of 0.757 and 0.709, respectively. The confusion matrix for external validation of Model 2 was shown in Supplementary Fig. S3.

Discussion

In recent years, the prevalence of MPP has been on the rise, with high mortality and complication rates24. Although most children with MPP have a good prognosis after macrolide treatment, children with SMPP have a poor prognosis with macrolide treatment and a prolonged disease course. Therefore, early identification of SMPP is beneficial for rational treatment, reducing complications, and optimizing the utilization of medical resources. SMPP presents distinct diagnostic challenges due to its wide range of clinical manifestations and overlapping symptoms with other common respiratory diseases. Timely identification of pneumonia etiology can improve clinical management, including decision-making on antibiotic use.

With the development of AI, ML based on prediction models has been widely used for risk prediction and assisting diagnosis in medicine25,26,27. Increasing studies have proven that ML algorithms have advantages over traditional statistical methods in building prediction models. In the present study, we employ five commonly used machine learning algorithms and systematically evaluate their performance. Through hyperparameter tuning and model selection, we identified the LightGBM algorithm as the best-performing model, achieving superior predictive performance compared to other algorithms. This underscores the significance of algorithm selection and parameter optimization in enhancing model performance. Compared with other diagnostic models, our model demonstrates a comparable high performance (AUCROC=0.975). For example, the AUC of the Nomograph model developed by Chang et al. to predict SMPP in pediatric patients by admission laboratory indicators was 0.77728. Previous studies mainly used traditional calculation tools (nomograms) to establish MPP prediction models. Therefore, the utilization of complex ML algorithms to establish identification and prediction models for SMPP remains a relatively underexplored area of research.

Numerous studies have reported the correlation between age and MPP29. Children over the age of 5 years are more susceptible to MP infection and exhibit more severe MPP symptoms. The median age of patients with viral pneumonia is significantly lower than that of MPP patients. In modeling cohort, the median age of SMPP patients is 6 years old, which is consistent with the previous finding that MP infection rarely occurs in children under 3 years of age30. Lu et al.31. reported that age, LDH, and ESR were significant factors in predicting RMPP using logistic regression. LDH is widely distributed in various tissues of the body, including lung tissue. As a non-specific marker of tissue damage and cell death, serum LDH has long been used for the diagnosis of pulmonary infectious diseases as well as for prognosis prediction32. Consistent with previous reports33,34, the current study has found that LDH is significantly increased in SMPP patients compared to patients with other respiratory diseases. CRP has also been enrolled in our diagnosis model. CRP is recognized variable of inflammation and has been shown in many studies to be significantly elevated in children with MPP and is associated with disease severity. A study has found that children with CRP > 15.49 mg/L have a higher risk of developing SMPP.

Weights are numerical parameters representing the importance of different features or inputs in a model. In the realm of clinical diseases, researching weights in relation to disease relevancy has become a hot topic. By conducting weight analysis, we find the weight of coagulation indicators, including PTA, PT, and APTT, are significantly prominent in the diagnostic model for SMPP (Model 1). This discovery provides useful references for our in-depth research and further clinical applications. Recent studies have shown that coagulation abnormalities in children with MPP are not uncommon. The specific mechanism of abnormal coagulation function in MP infection is unclear but may be related to MP inducing massive synthesis and secretion of a series of cytokines, such as interleukins, tumor necrosis factors, and chemokines, leading to local vascular damage and accumulation of metabolites in that area, resulting in vascular blockage35. It is also found that abnormal coagulation may be involved in the development of SMPP and may be closely related to the development and prognosis of its complications36,37. This study compared the coagulation function between the SMPP group and the other respiratory diseases group and found that the differences in PTA, PT, and APTT levels were statistically significant. Previous studies have found higher D-dimer levels in children with MPP than in healthy children and higher D-dimer levels in patients with SMPP, particularly those with SMPP with extra-pulmonary complications38. However, our study shows that detecting the levels of coagulation function indices, especially PTA, PT, and APTT, can help the early identification of SMPP and other respiratory diseases patients. Given the limited content of this study, multiple controlled studies and multicentre studies can be conducted in the future to further confirm the findings of this study. It is also possible to analyse the changes in coagulation in SMPP in conjunction with other tests, such as thromboelastography; the underlying mechanisms can be explored to gain insight into the mechanisms of coagulation abnormalities and SMPP.

According to reports, the frequency of extrapulmonary symptoms associated with MP infection has increased in recent decades39. The occurrence of cardiac and liver complications related to MPP has been well confirmed40,41. Cardiac events and liver involvement are the two most common extrapulmonary manifestations, and multiple factors are involved in the pathogenesis of these conditions. A study found that TIM1 is associated with CK-MB, whereas TIM3 and TLR2 are associated with ALT, indicating that cardiac and liver damage caused by MP infection results from a combination of inflammatory cytokines and autoimmune reactions. In the risk predictive model of organ damage for SMPP, age emerged as an important predictor. Li et al. reported that MP infections cause more serious myocardial damage in children aged 13–36 months and 72 months-14 years as detected using serum CK-MB concentrations42. In addition, our study found that the median age of SMPP children with myocardial damage was significantly lower than that of SMPP children with liver damage. These findings suggest that paediatrician should pay more attention to age-specific differences in the extrapulmonary complications associated with MP infections.

Myocardial enzymes are the main serum enzymes used for the clinical diagnosis of MPP complicated with myocardial damage, including CK, LDH, CK-MB, and AST43. The fact that AST and LDH can be detected in a variety of tissues results in a lack of specificity. As a myocardium-specific enzymatic indicator, CK-MB is scarcely found in other tissues. The change in CK-MB activity is closely associated with the necrosis of myocardial cells44. In Cohort 2 of this study, we found that the levels of CK-MB in SMPP patients with myocardial damage are significantly higher than those in patients with liver damage or non-damage. Additionally, among pediatric patients with extrapulmonary damage, except for myocardial damage, the CK-MB levels in those with liver damage are also higher than in those non-damage. A study shows asymptomatic elevation of liver enzymes in MPP patients45. ALT levels increase significantly after infection, indicating liver involvement during the disease process. In Cohort 2, the ALT levels in patients with liver damage are significantly higher than those in patients with myocardial damage and non-damage. In this study, the feature importance analysis reveals that ALT and CK-MB are among the top contributing variables to the Model 2. Additionally, parameters such as AST, WBC, and LDH rank among the top ten in feature importance scores and serve as crucial predictors of organ damage for SMPP in children. In contrast to the prevalent binary classification methods employed by existing ML models, our Model 2 represents a pioneering effort in the realm of multiclass classification. A single model can provide the risk probabilities of SMPP associated with three distinct organ injury types, which enhances clinical operability and applicability.

This study employs readily available and simple indicators for modelling, aiming to maximize the clinical applicability. However, there are several limitations: (1) Our research cohorts are relatively homogenous, although the performance of models are promising, they may vary across different healthcare settings and patient populations. Therefore, future studies should aim to validate these models across diverse populations to explain potential confounding factors such as ethnicity. Moreover, we plan to use datasets from diverse sources and geographical locations for external validation in the future, which is crucial for evaluating the generalizability of the model and ensuring its applicability in varied clinical scenarios. (2) Due to the limited technology and resources available, this study is unable to demonstrate the ensemble learning effect of multiple machine learning models. In the future, our studies may explore using more advanced algorithms, such as deep learning or ensemble learning algorithms, to further improve prediction accuracy.

In conclusion, the machine learning algorithm based on laboratory parameters can provide clearer decision-making guidance for SMPP differential diagnosis and effectively predict the occurrence of organ damage in SMPP children, providing strong references for clinical diagnosis and treatment. It is worth further clinical research and promotion.