Introduction

Both osteosarcoma (OS) and chondrosarcoma (CS) are common primary malignant bone tumors1,2,3,4,5,6,7, but their treatment methods vary greatly due to the varying sensitivity of radiotherapy/chemotherapy8,9. Therefore, early differential diagnosis is of great significance. At present, bone tissue biopsy is still the gold standard for diagnosis, but biopsy has invasiveness, sampling errors, and potential risks of tumor spread10. Therefore, it is necessary to find a non-invasive and highly accurate method for early differentiation between OS and CS. Magnetic resonance imaging (MRI) is the preferred detection method for diagnosing, staging, and monitoring bone and soft tissue lesions, especially fat suppressed T2-weighted imaging (T2WI-FS) and contrast-enhanced T1-weighted imaging (CET1), which have high sensitivity in describing lesion morphology and surrounding structures involved11. However, there are still great difficulties in the differential diagnosis between OS and CS by the above sequences alone12. The concept of radiomics was first reported in 2012, which converts images into mineable data with high fidelity and throughput for extraction and application in clinical decision support systems13,14,15. It has been widely applied in tumor staging, differential diagnosis, and efficacy prediction16,17,18. However, little study of MRI-based radiomics to differentiate OS from CS has been reported19,20. In this study, we aim to construct a radiomics model based on the CET1 and T2WI-FS sequences of MRI, and explore its value in distinguishing between OS and CS.

Methods

Study design and consent

This is a retrospective cohort study (Level III evidence) in which subjects received a image examination. This study was approved by the Ethics Committee of the Second Xiangya Hospital of Central South University (No. K0340) and was conducted in accordance with the Declaration of Helsinki. All data used in the present study were obtained from the electronic medical records of our hospital and treated with strict confidentiality. The requirement of obtaining specific informed consent for this retrospective study was waived. All patients provided written informed consent for receiving this image examination. Due to the retrospective nature of the study, Jiangdong Ni (member of Institutional Review Board) waived the need of obtaining informed consent.

Patients

The general information of patients who met the following inclusion criteria and exclusion criteria in the Second Xiangya Hospital of Central South University from January 2014 to October 2020 was retrospectively analyzed, including patient gender, age, tumor localization and histopathological results. Inclusion criteria: (1) preoperative MRI examination, including at least CET1 and T2WI-FS sequences; (2) pathologically confirmed OS or CS cases by needle biopsy; (3) preoperative MRI and biopsy interval not exceed 3 months. Exclusion criteria: (1) patients with other tumors than OS or CS; (2) patients received surgery, radiotherapy, chemotherapy or molecular targeted therapy before MRI; (3) occult lesions which were poorly showed on the images, and difficult to delineate the region of interest (ROI) and extract the radiomics characteristics; (4) images or related sequences were lost.

Data collection of MRI

Preoperative examination was performed using a SIEMENS 3.0T MR scanner (Magnetom Skyra, Siemens Healthcare, Erlangen, Germany), at least including CET1 and T2-FS sequences. In this study, CET1 and T2WI-FS sequences were selected for radiomics feature extraction and analysis. Dimeglumine gadopentetate injection (Magnevist®, Bayer HealthCare AG, Daxing District, Beijing) was intravenously injected with weight-based dosing regimen (0.1 mmol/L/kg body weight) and injection rate of 2.5 mL/s. All MR data were retrieved from our hospital’s Picture Archiving and Communication System (PACS) and saved in DICOM format for further analysis.

Preprocessing of MRI images

All images were transferred to ITK-SNAP software (version 3.8.0, http://www.itksnap.org/) for delineation of tumor area. We draw the region of interest for each layer of MRI and finally generate three-dimensional segmentation of the region of interest. All lesions were manually segmented by a radiologist with more than 5 years of experience in MRI diagnosis. The accuracy of the segmentation was confirmed by an other radiologist with more than 10 years of experience in MRI diagnosis. Two radiologists modified the specific segmentation of image when there was disagreement after discussion. ROI was delineated on one image of each sequence, including the entire tumor, peritumoral edema, and periosteal reaction. The selected slice or image showed the maximum diameter of the lesion.

Extraction of radiomics features

Pyradiomics (https://pyradiomics.readthedocs.io/en/latest/), a commonly used Python open-source software package for radiomics studies, is applied to extract radiomics features from medical images (CT, PET, MRI). Five categories of radiomics features defined by Pyradiomics include first-order gray level features, shape features, gray level co-occurrence matrix (GLCM), gray level run length matrix (GLRLM), and wavelet transform features21.

Selection of features

The least absolute shrinkage and selection operator (LASSO) method was used for the screening of the optimal features in the training set. The regularization parameter (λ) in the feature selection was adjusted using the minimum criterion, with 10 cross-validations used. The selected features with non-zero coefficients were used to construct the radiomics model. The radiology score (Rad-Score) of each patient was calculated with the weighting coefficient of logistic regression model.

Statistical analysis

R software (R Foundation for Statistical Computing, Vienna, Austria. URL: http://www.R-project.org), and SPSS 22.0 software were applied for statistical analysis, in which LASSO Logistic regression analysis and cross-validation were performed with the “glmnet” software package, with P < 0.05 as statistically significant.

Results

Extraction/selection of radiomics feature

Figure 1 showed the study workflow and the basic procedure was shown in Fig. 2. 116 patients were finally enrolled in this study, including 52 females and 64 males, aged 5–82 (28.9 ± 19.8) years. Those cases included 79 OS cases and 37 CS cases, and were randomly assigned to the training group (n = 87) and the validation group (n = 29) in a ratio of 3:1. The training group was used for establishing the diagnostic model, and the validation group for verifying the diagnostic efficacy of model in differentiating OS from CS. The clinical characteristics of the patients in this study were shown in Table 1.

Fig. 1
figure 1

The flowchart of eligible patients.

Fig. 2
figure 2

Basic procedures for the study. (1) Screen patients suitable for this study from all OS and CS patients in our hospital and obtain CET1 and T2WI-FS sequence data; (2) experienced radiologist outlines the tumor on CET1 and T2WI-FSMR images; (3) extract 530 initial radiomics features from the tumor area delineated by the original MR image; (4) select radiomics features by least absolute shrinkage and selection operator (LASSO) logistic regression; (5) use ROC analysis to evaluate the established radiomics model.

Table 1 General information of patients and characteristics of tumors.

Totals of 530 initial features were extracted from the MR raw data and were divided into first-order features (n = 18), shape features (n = 8), GLCM features (n = 24), GLRLM features (n = 16), and wavelet transform features (n = 464). All sequence parameters were shown in Table 2. In this study, a total of 530 radiomics features were extracted in CET1WI images, including 18 first order features, 8 shape features, 16 GLRLM features, 24 GLCM features, and 464 wavelet transform features. Some features are reported in Table 3. Intra-class correlation coefficients (ICCs) ranged from 0.66 to 0.99 (average = 0.90) for all extracted features. The twenty-three selected radiomics features showed good inter-reader agreement, with ICCs ranging from 0.90 to 0.99.

Table 2 MRI sequence parameters.
Table 3 Radiomics features used in the analysis.

LASSO regression is the most widely used shrinkage method, especially in reducing overfitting and enhancing prediction accuracy. Previous studies have directly used LASSO for feature shrinkage and dimensionality reduction to screen out features able to used in the construction of logistic regression models22,23,24,25,26,27,28,29. Hence, the LASSO method was performed to analyze the extracted features in this study. We selected the 15 features which differentiated OS from CS significantly from the T2WI-FS images and the 8 most significant features from the CET1 images using the LASSO method (Fig. 3). The selected features were shown in Table 4. The distribution of CET1 and T2-FS sequence-based radiomics features was shown by violin diagram (Figs. 4, 5).

Fig. 3
figure 3

Radiomics features were selected from CET1 and T2WI-FS using the LASSO binary logistic regression model. (a,c) is the tuning parameter (λ) in the LASSO model selected by ten cross-validations based on the minimum criterion. The lower x axis represents log (λ), the upper x axis represents the number of elements, and the y axis represents the binomial deviation. The dashed vertical line indicates the deviation value for each model with a given λ. The vertical dashed black line indicates the optimal value of λ. When the optimal λ is selected as 0.0008 in (a), log (λ) value is −3.13, and when the optimal λ is selected as 0.0031 in (c), log value (λ) is −2.506352. (b,d) Shows the LASSO coefficient profiles of 530 radiomics features. (b) Shows 15 selected features with non-zero coefficients in the T2WI-FS sequence and (d) shows 8 selected features with non-zero coefficients in the CET1 sequence.

Table 4 Radiomics feature selection by the LASSO method.
Fig. 4
figure 4

Violin plot of 8 screened radiomics features based on CET1 sequence: OS represents osteosarcoma and CS represents chondrosarcoma. The violin plot analysis compared the difference and distribution of 8 radiomics features extracted from CET1 sequences of OS and CS by LASSO method.

Fig. 5
figure 5

Violin plot of 15 screened radiomics features based on T2-FS sequence: OS represents osteosarcoma and CS represents chondrosarcoma. The violin plot analysis compared the difference and distribution of 15 radiomics features extracted from T2-FS sequences of OS and CS by LASSO method.

Differentiation performance

Table 5 present the sensitivity and specificity of the training group and validation group in distinguishing OS from CS using T2WI-FS and CET1 data. In the training group, the specificity of the radiomics model constructed based on CET1 was 0.897 and the sensitivity was 0.81, and the specificity of the radiomics model constructed based on T2-FS was 0.862 and the sensitivity was 0.966. Radiomics models based on CET1 and T2WI-FS images in the training group obtained AUC values of 0.894 (95% CI 0.817–0.970) and 0.970 (95% CI 0.940–0.999), respectively, by ROC curve analysis. In the validation group, the specificity of the radiomics model constructed based on CET1 was 0.75 and the sensitivity was 0.81, and the specificity of the radiomics model constructed based on T2-FS was 0.875 and the sensitivity was 0.857. Radiomics models based on CET1 and T2WI-FS images in the validation group obtained AUC values of 0.821 (95% CI 0.642-1.000) and 0.899 (95% CI 0.785-1.000), respectively (Fig. 6).

Table 5 Differentiated performance based on T2WI-FS and CET1 in the training and validation group.
Fig. 6
figure 6

ROC curves for the training and validation groups of the radiomics model. ROC curves for the T2WI-FS and CET1 based radiomics models for OS and CS in the training (a) and validation (b) groups. The AUC of CET1 and T2WI-FS in the training group was 0.894 (95% CI 0.817–0.970) and 0.970 (95% CI 0.940–0.999), respectively; and the AUC in the validation group was 0.821 (95% CI 0.642–1.000) and 0.899 (95% CI 0.785–1.000), respectively.

Discussion

In this study, we developed a radiomics model based on T2WI-FS and CET1 sequences to differentiate between OS and CS. This radiomics model performed satisfactorily, with AUC-based accuracies of 0.97 and 0.89 in the training and validation cohorts, respectively. Meantime, the specificity (75% and 87.5%) and sensitivity (81% and 85.7%) of CET1 and T2-FS were high in the verification group, suggesting that the diagnostic model has good diagnostic efficacy.

OS and CS are often difficult be accurately differentiated in clinical practice based on clinical features and imaging data, and ultimately need to be confirmed by needle biopsy. In terms of the age of predilection, although OS mainly occurs in children and young people, and CS mainly occurs in the elderly, the highest incidence rate is between 50 and 60 years old, but it occasionally occurs in younger individuals, so it is impossible to absolutely distinguish OS from CS based on age30. In addition, although pathological biopsy has a sensitivity and specificity of 86% and 95% in diagnosing bone tumors, it is still difficult to distinguish cartilage lesions. Pathological biopsy highly relies on the biopsy site and fixed tumor components, making it difficult to distinguish some subtypes of OS and CS25,26,27. For example, the common site and imaging characteristics of dedifferentiated chondrosarcoma (DDCS) are similar to those of OS, involving the proximal femur and presenting as a lytic, destructive, and invasive lesion with elastic regions and cortical de struction, both of which are easily confused with each other. Chondroblastic osteosarcoma (COS) is prone to producing cartilage like matrix, with the most common being high-grade transparent cartilage, making it extremely difficult to distinguish from CS31. MRI is a routine diagnostic tool for various tumors, as it has good soft tissue resolution and can more clearly display bone marrow replacement and cortical destruction21. It provides good tissue contrast, significantly better than CT and X-ray examinations. However, considering age distribution, high incidence sites, and similar pathological tissue types, MRI is also difficult to accurately distinguish between OS and CS. The emergence of radiomics can precisely be used to solve the above-mentioned problems.

This study is the first attempt to use MRI radiomics to differentiate the diagnosis of OS and CS. Previous studies have explored the effectiveness of CT and Micro CT radiomics in distinguishing between OS and CS, but the results are not very satisfactory as they cannot distinguish the differentiation of tumor stroma in detail and determine the boundaries32,33. Pathological biopsy is prone to loss of information due to tissue processing, while MRI can preserve the initial information of the tumor to the greatest extent possible. Therefore, MRI radiomics is more suitable for clinical differential diagnosis and tumor classification through raw feature extraction34.

Previous studies have reported that CET1 and T2WI-FS are widely used in the differentiation of malignant tumors. Moreover, T2WI-FS can effectively display the edge of the lesion, while CET1 can display the degree of blood supply to the lesion. Both have unique advantages in distinguishing between necrosis and solid tumors35,36,37,38,39,40,41,42. Therefore, this study selected the above sequence to verify that both have high accuracy in distinguishing between OS and CS, and further found that T2WI-FS has better diagnostic value than CET1, with higher sensitivity and specificity.

Our study has several limitations. First, our study has a small sample size, which may affect the diagnostic performance of radiomics models. However, we believe that our findings have significant clinical implications and require multi-sample center research to improve diagnostic efficiency. Second, our study used LASSO regression for dimensionality reduction and selected the final features, without using other statistical methods for initial screening. However, there is still controversy in previous studies regarding whether to conduct initial screening, and LASSO regression is more suitable for analyzing a large number of radiomics features with a small sample size, so our results are reliable. Third, future studies should focus on other factors that may affect diagnostic performance of radiomics models for bone malignant tumors.

Conclusions

In this study, we developed a radiomics model based on T2WI-FS and CET1 sequences to differentiate between OS and CS. This model exhibits good performance and can help clinicians make decisions and optimize the use of healthcare resources.