Synthetic lumbar MRI can aid in diagnosis and treatment strategies based on self-pix networks

Song, Ke; Zhu, Wendong; Zhang, Zhenxi; Liu, Bin; Zhang, Meiling; Tang, Tinglong; Liang, Jie; Wu, Weifei

doi:10.1038/s41598-024-71288-4

Download PDF

Article
Open access
Published: 02 September 2024

Synthetic lumbar MRI can aid in diagnosis and treatment strategies based on self-pix networks

Ke Song^1,2^na1,
Wendong Zhu³^na1,
Zhenxi Zhang⁴,
Bin Liu⁵,
Meiling Zhang^1,2,
Tinglong Tang³,
Jie Liang^1,2 &
…
Weifei Wu^1,2

Scientific Reports volume 14, Article number: 20382 (2024) Cite this article

1234 Accesses
1 Citations
Metrics details

Subjects

Abstract

CT and MR tools are commonly used to diagnose lumbar fractures (LF). However, numerous limitations have been found in practice. The aims of this study were to innovate and develop a spinal disease-specific neural network and to evaluate whether synthetic MRI of the LF affected clinical diagnosis and treatment strategies. A total of 675 LF patients who met the inclusion and exclusion criteria were included in the study. For each participant, two mid-sagittal CT and T2-weighted MR images were selected; 1350 pairs of LF images were also included. A new Self-pix based on Pix2pix and Self-Attention was constructed. A total of 1350 pairs of CT and MR images, which were randomly divided into a training group (1147 pairs) and a test group (203 pairs), were fed into Pix2pix and Self-pix. The quantitative evaluation included PSNR and SSIM (PSNR1 and SSIM1: real MR images and Pix2pix-generated MR images; PSNR2 and SSIM2: real MR images and Self-pix-generated MR images). The qualitative evaluation, including accurate diagnosis of acute fractures and accurate selection of treatment strategies based on Self-pix-generated MRI, was performed by three spine surgeons. In the LF group, PSNR1 and PSNR2 were 10.884 and 11.021 (p < 0.001), and SSIM1 and SSIM2 were 0.766 and 0.771 (p < 0.001), respectively. In the ROI group, PSNR1 and PSNR2 were 12.350 and 12.670 (p = 0.004), and SSIM1 and SSIM2 were 0.816 and 0.832 (p = 0.005), respectively. According to the qualitative evaluation, Self-pix-generated MRI showed no significant difference from real MRI in identifying acute fractures (p = 0.689), with a good sensitivity of 84.36% and specificity of 96.65%. No difference in treatment strategy was found between the Self-pix-generated MRI group and the real MRI group (p = 0.135). In this study, a disease-specific GAN named Self-pix was developed, which demonstrated better image generation performance compared to traditional GAN. The spine surgeon could accurately diagnose LF and select treatment strategies based on Self-pix-generated T2 MR images.

Translation of computed tomography images to T2-Weighted magnetic resonance images of lumbar spine using generative adversarial networks on sagittal images

Article Open access 26 May 2025

ISRSL0: compressed sensing MRI with image smoothness regularized-smoothed $\:{\ell}_{0}$ norm

Article Open access 16 October 2024

Lumbar spine segmentation in MR images: a dataset and a public benchmark

Article Open access 02 March 2024

Introduction

Lumbar fracture (LF) is the most common type of spinal fracture and is more likely to occur in elderly individuals¹. LF is classified into sub-acute fracture (SF) and acute fracture (AF).The characteristics of SF include chronic lower back pain, vertebral height loss and even lumbar kyphosis deformity, which are often caused by microfractures due to poor bone quality in elderly individuals. The characteristics of AF include severe lower back pain, lower limb paralysis and even urinary and faecal incontinence, which are often caused by severe trauma^2,3. Computerized tomography (CT) and magnetic resonance imaging (MRI) are standard methods for diagnosing LF. Although CT scans are fast and suitable for high spatial resolution, they have a limited capacity to distinguish between the characteristics of SF and AF⁴, which is very important for selecting treatment strategies. MR excels in diagnosing AF, but it is costly, time-consuming and inconvenient in actual clinical situations⁵. For example, MR examination cannot be performed for emergencies that occur at night, which may delay the diagnosis and treatment of trauma patients and adversely affect neurological recovery⁶. In addition, some patients are not suitable for MR examinations, such as those with metal implants or who experience claustrophobia. Furthermore, MR machines are not available in some developing countries and remote regions. Therefore, exploring new techniques to convert CT images to MR images holds significant practical relevance.

Artificial intelligence, particularly deep learning (DL), has made significant progress in medical imaging in recent years. For normal medical images, some studies have shown that DL can help to accomplish complicated tasks, such as recognition and segmentation of features in two-dimensional (2D) or three-dimensional (3D) images^7,8, image transformation and simulation^9,10, and reconstruction of three-dimensional images^8,11. Most of these findings are limited to technological innovations. If these techniques are used for clinically specific diseases, it is essential to accurately process medical images depicting abnormalities, which places greater demands on DL and researchers due to the complexity of the disease ___location and degree. Previous studies have reported automatic disease classification based on ___location and features¹², but these investigations were confined to the images themselves and were not associated with clinical diagnosis or treatment. The most recent studies focused on single-modal disease image synthesis and its clinical diagnostic value^13,14, indicating that mutual transformation of different models may be feasible, such as CT images synthesized from MR images. The generative adversarial network (GAN), including an image generation network (G) and an image discrimination network (D)¹⁵, is a promising approach in the field of image transformation. Many complicated tasks can be achieved through the constant confrontation between G and D, such as image enhancement and denoising^16,17, image alignment^18,19, image synthesis¹⁴ and image transformation^20,21. In studies of multimodal medical image transformation, a GAN was used to synthesize CT data from head and neck MR images to guide tumour radiotherapy dosing²⁰, which is critical for accurately eradicating tumours and minimizing patient harm. Weijuan et al. reported that a GAN can be used to diagnose lumbar AF via digital radiography (DR)²¹, which is helpful for improving the efficiency of LF diagnosis.

Compared to DR and CT, MRI is the gold standard for diagnosing AF due to its detailed damage characteristics and lack of organ interference⁵. To the best of our knowledge, no studies on CT-to-MR conversion have specifically focused on LF.

The primary objectives of this study were twofold:

To innovate and develop a spine disease-specific GAN and verify the quality of synthetic images.
To evaluate whether synthetic MRI of LF affects the clinical diagnosis and treatment strategies using large samples.

Methods and materials

Patients and images

From January 2017 to July 2023, 2902 patients with LF were treated in our hospital. Patients whose CT and MR images were obtained before surgery and within one week of each other were included. The exclusion criteria included significant differences in lumbar spine curvature between CT and MRI, metal artefacts, pathological fractures (spinal tuberculosis and spinal tumours), non vertebral fractures (vertebral arch, transverse process, spinous process), and other diseases (spondylolisthesis, disc herniation). Ultimately, 675 LF participants met the inclusion and exclusion criteria (Fig. 1). Moreover, 55 participants with non lumbar fracture (NLF) were randomly included to examine the potential impacts of LF on the quantitative evaluation. Patient details, including age, sex, fracture ___location, and symptoms, are shown in Table 1.

Table 1 Demographic and clinical characteristics of the participants.

Full size table

For each participant included, two mid-sagittal CT and T2-weighted MR images were selected. Ultimately, the study included 1350 pairs of LF CT and MR images, along with 110 pairs of NFL CT and MR images (Fig. 2). This study was approved by the review board of China Three Gorges University. All the study methods were conducted in accordance with the China Three Gorges University guidelines and regulations, and all the experimental protocols were approved by the China Three Gorges University committee. The requirement for informed consent was waived by the China Three Gorges University committee because retrospective data were used.

CT and MR scans

The vast majority of participants were scanned using a 64-row multidetector CT scanner (LightSpeed VCT; GE Medical System; GE: General Electric Company) and dual-source CT scanners (SOMATOM Definition Flash; SIEMENS Healthineers). Few patients received a 16-row multidetector CT scanner (SOMATOM SCOPE, Siemens Healthineers) in the emergency department. All CT images were reconstructed using GE ADW 4.6 (Advantage workstation). All MR scans were acquired using 3.0 T MR scanners (GE MR750) (Tables 2, 3).

Table 2 Scan parameters for the CT protocol.

Full size table

Table 3 Scan parameters for the sagittal T2-weighted MR imaging protocol.

Full size table

Paired data processing

The images were cropped manually to focus precisely on the lumbar spine: the top was the T12 upper edge, the bottom was the S1 lower edge, the front was the L4 or L5 anterior edge, and the rear was the final edge of the lumbar spinous process. The brightness and contrast of the images were adjusted to accentuate the features of the vertebrae, and the images were further subjected to a noise reduction algorithm (Fig. 2). By splicing cropped CT and MR images, 1350 matched LF images and 110 matched NLF images were obtained. These images were randomly divided into 1,147 training, 203 test, 89 training and 21 test datasets.

Model architecture

A new model called Self-pix was constructed that introduces Self-Attention²². The Residual Network²³ (Resnet) was used as the generator of Self-pix, and the Markovian discriminator²⁴ (PatchGAN) was used as the discriminator (Fig. 3).

Generator

The Pix2pix²⁵generator used the Unet²⁶ structure, but Self-pix utilised the Resnet²³ structure (Fig. 3). Resnet has advantages over Unet in dealing with complex images and is more suitable for our subject. Self-pix contains 9 ResNet_blocks, which are built by adding a shortcut every two layers of the network (shortcut connections); these can deepen the network model: the deeper the network, the better is the representation ability. However, an increase in depth can cause problems, such as elevation dissipation and gradient explosion. The emergence of residual blocks effectively solves the above problems.

Discriminator

Self-pix’s discriminator is PatchGAN²⁴ (Fig. 3). The difference between the PatchGAN and the general GAN is that the PatchGAN outputs an N*N matrix (N = 70 in Self-pix). All the elements in this matrix have only two values, true or false, which increase the receptive field of the model to the original image.

Objective function

Self-pix is a variant of the conditional GAN²⁷ (cGAN), the objective function of which can be expressed as follows:

$$ {\mathcal{L}}_{{{\text{cGAN}}}} \left( {{\text{G}},{\text{ D}}} \right) = {\mathbb{E}}_{{{\text{x}},{\text{y}}}} \left[ {\log {\text{D}}\left( {{\text{x}},{\text{y}}} \right)} \right] + {\mathbb{E}}_{{{\text{x}},{\text{z}}}} \left[ {\log \left( {1 - {\text{D}}\left( {{\text{x}},{\text{ G}}\left( {{\text{x}},{\text{z}}} \right)} \right)} \right)} \right] $$

(1)

G wants to reduce this objective function, and D wants to play against G. Self-pix chooses the objective function of the L1 distance because L1 can promote less fuzzy generation; its objective function is expressed as follows:

$$ {\mathcal{L}}_{{{\text{L}}1}} \left( {\text{G}} \right) = {\mathbb{E}}_{{{\text{x}},{\text{y}},{\text{z}}}} \left[ {{\text{y}} - {\text{G}}\left( {{\text{x}},{\text{z}}} \right)_{1} } \right] $$

(2)

Self-pix’s objective function is described as follows:

$$ G^{*} = arg\mathop {\min }\limits_{G} \mathop {\max }\limits_{D} {\mathcal{L}}_{cGAN} \left( {G,D} \right) + \lambda {\mathcal{L}}_{L1} \left( G \right) $$

(3)

where $\lambda$ is a weight parameter that is used to balance the two objective functions L1 and cGAN.

Self-attention

Self-Attention²² has been widely used in NLP since its inception. Parmar et al.²⁸ proposed a model that adds Self-Attention to the image field to generate images. Han Zhang et al.²⁹ proposed a model in which Self-Attention is added to GAN to capture the query position of an image through Self-Attention. Self-Attention computes three matrices, Query, Key, and Value, by the elements of the input sequence X. The corresponding attention score is obtained by dot multiplication of Q and K. Then, the Softmax activation function is used to change its value to 0–1 to obtain the corresponding attention weight. We further obtained the output by multiplying the dot by V and summing it.

Self-Attention was added to the generator's first downsampling layer, the first upsampling layer and Resblocks to find the best-optimised model. This process is conducive for the model to capture the features in the input image and improve the model's performance and imaging effect. To further improve the model's performance and reduce the computational overhead, we added spectral normalisation³⁰ in the convolutional layer of D.

Quantitative image analysis

The mean square error (MSE), peak signal-to-noise ratio (PSNR), and structural similarity (SSIM)³¹ were calculated to evaluate the differences between the original MR image and the synthetic MR image. Pix2pix-generated and Self-pix-generated MR images were compared with the original MR images using the SSIM and PSNR.

The calculation formula for MSE is as follows: M and N are the number of rows and columns in an image, X_ij is the pixel intensity in the real MR image, and Y_ij is the pixel intensity in the synthesized MR image.

$$ {\text{MSE}} = \frac{1}{{{\text{M}} \times {\text{N}}}}\mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{m}}} \mathop \sum \limits_{{{\text{j}} = 1}}^{{\text{n}}} \left( {{\text{X}}_{{{\text{ij}}}} - {\text{Y}}_{{{\text{ij}}}} } \right)^{2} $$

(4)

The PSNR is an error-sensitive image quality evaluation tool based on errors between corresponding pixel points. It does not consider the visual characteristics of human eyes. The calculation formula for the PSNR is as follows: MAX_i is the maximum in an image. To ensure the accuracy of the calculation, we converted the image type from 8 to 64. In this case, MAX_i = 1.

$$ {\text{PSNR}} = 20{\text{log}}_{10} \left( {\frac{{{\text{MAX}}_{{\text{i}}} }}{{\sqrt {{\text{MSE}}} }}} \right) $$

(5)

The SSIM is applied to measure the structural similarity of two images, and its values range from 0 to 1. When two images are infinitely identical, the SSIM is close to 1. The following formula was used to calculate the SSIM between two images at a given pixel: where µ_x and µ_y are the mean values of the two images within a small window around a fixed pixel point;ð_x and ð_y are the standard deviations of the two images computed within the same window; ð_xy is the covariance between the two images computed within the same window; and C₁ and C₂ e regularisation constants to prevent denominators of zero.

(6)

Qualitative image analysis

Three practised spine surgeons with more than ten years of professional experience who were blinded to the acquisition protocols were invited to independently conduct the qualitative evaluation independently, which consisted of 203 original MRIs and 203 Self-pix-generated MRIs. First, the surgeons identified whether patients had acute fractures and recorded the fractured vertebrae if confirmed. In addition, all surgeons discussed and then chose an appropriate treatment strategy based on the patient's basic information and imaging findings: conservative treatment (ConT), percutaneous vertebroplasty (PVP)/percutaneous kyphoplasty (PKP), minimally invasive surgery (MIS), and open decompression and fixation (ODF). Finally, the overall similarity was evaluated using the following four-point scale: 1 = undiagnosable, 2 = acceptable for diagnosis, 3 = good and 4 = excellent.

Statistical analysis

Normally distributed continuous variables are presented as the mean ± standard deviation (SD). Otherwise, variables are presented as medians with the corresponding 25^th and 75^th percentiles. Categorical variables are expressed as absolute numbers with percentages. Both Pix2pix and Self-pix utilised the same datasets, so the statistical method of paired validation was adopted to check the difference between the two GANs. The paired-sample ttest was used for quantitative analysis if the data were normally distributed. Otherwise, the Wilcoxon signed-rank test was used. In the qualitative analysis, McNemar's test was used to assess the accuracy of Self-pix in diagnosing AF, and the McNemar-Bowker test was used to assess treatment strategy consistency. All statistical analyses were performed using IBM SPSS Statistics software (version 25.0, IBM Corp). A two-tailed P value less than 0.05 indicated a statistically significant difference.

Results

Quantitative assessment

The PSNR1 and SSIM1 were compared between real MR images and Pix2pix-generated MR images. The PSNR2 and SSIM2 were compared between real MR images and Self-pix-generated MR images. The region of interest (ROI) was defined as the fractured vertebra (Fig. 4).

The PSNR and SSIM of the different groups are displayed in Fig. 5. In the LF group, the PSNR1 and PSNR2 were 10.884 and 11.022 (Wilcoxon signed-rank test, Z = − 6.603, p < 0.001), and the SSIM1 and SSIM2 were 0.767 and 0.771 (paired-sample t test, t = − 5.546, p < 0.001), respectively. In the ROI group, the PSNR1 and PSNR2 were 12.351 and 12.670 (paired-sample t test, t = − 2.937, p = 0.004), and the SSIM1 and SSIM2 were 0.817 and 0.832 (paired-sample t test, t = − 2.822, p = 0.005), respectively. In the NLF group, the PSNR1 and PSNR2 were 17.254 and 17.709 (Wilcoxon signed-rank test, Z = − 2.485, p = 0.013), and the SSIM1 and SSIM2 were 0.933 and 0.939 (Wilcoxon signed-rank test, Z = − 2.659, p = 0.008), respectively. The Self-pix-generated MRI parameters were greater than the Pix2pix-generated MRI parameters in all groups. The SSIM and PSNR of the ORI group with Self-pix increased by 7.91% and 14.91%, respectively, compared with those of the LF group, but Pix2pix improved by only 6.5% and 13.48%, respectively (Fig. 5C).

Qualitative assessment

A total of 3045 (203*5*3) pairs of vertebrae were used to assess the consistency of the diagnosis of acute lumbar fractures. Among the pairs of vertebrae, 518 were diagnosed with AF and 2527 were normal (including SF) using real-MRI (R-MRI), and 512 were diagnosed with AF and 2533 were normal (including SF) using synthetic-MRI (S-MRI). No statistical difference was found between R-MRI and S-MRI (Mcnemar’s test, p = 0.689). The sensitivity was 84.36% (95% CI 80.98–87.24%), and the specificity was 96.65% (95% CI 96.29–97.63%).The consistency between Self-pix-generated MRI and real MRI for diagnosing acute lumbar fracture was excellent (Cohen’s kappa test, κ = 0.818, p < 0.001) (Table 4).

Table 4 Diagnostic consistency for vertebrae fracture between R-MRI and S-MRI: McNemar’s test.

Full size table

After removing patients with false positive and false negative findings based on synthetic MRI, 197 LFPs were included to evaluate treatment strategy consistency, and only four patients received different treatments (Fig. 6). The four patients underwent ConT with synthetic MRI; however, two patients were treated with PKP/PVP, and the others were treated with MIF. These findings revealed no significant difference (Mcnemar-Bowker test, χ² = 4, p = 0.135) and excellent consistency (Cohen’s kappa test, κ = 0.932, p < 0.001) between Self-pix-generated MRI and real MRI in terms of treatment strategy (Table 5). The overall similarity scores were as follows: none of the images were undiagnosable, 44 (7.22%) were acceptable, 232 (38.1%) were good, and 333 (54.68%) were excellent; the average score was 3.47 (Fig. 5D).

Table 5 Treatment strategy consistency between R-MRI and S-MRI: McNemar–Bowcker test.

Full size table

Discussion

The results of this study showed that synthetic T2 MRI achieved the same effect as real MRI in identifying AF, indicating its significant importance for precision medicine and promoting patient recovery. Although X-ray and CT examinations can better display longitudinal cracks in the vertebral body, the display is not ideal for horizontal vertebral fractures and mild compression fractures⁴. These fractures may only manifest as abnormalities in vertebral morphology on X-rays and CT scans, making their diagnosis highly dependent on the surgeon's experience. Although dual-energy computed tomography can provide information about bone marrow oedema, it has not been widely adopted in clinical practice due to its high cost and steep learning curve. In comparison, MR is not only widely used, but it also has significant advantages in terms of resolution, contrast, imaging sequence, soft tissue lesions, and bone marrow oedema⁵. In addition, MRI can reveal not only soft tissue conditions, such as blood and oedema, but also fracture lines, which can comprehensively determine the duration of damaged lesions⁴. T1-weighted sequences provide excellent contrast for bone, muscle and fat, which is beneficial for identifying normal and abnormal anatomical structures. T2-weighted sequences are highly sensitive to fluid and oedema, making them more effective for observing features such as bone marrow extravasation, blood and oedema around fractured vertebrae, which are crucial for diagnosing AF. Moreover, severe AF fragments may compress the spinal cord, and T2-weighted sequences provide superior visualization of the spinal cord, aiding in the selection of surgical strategies and improving patient prognosis. After vertebral AF, the T1-weighted and T2-weighted signals are irregular, with sharp edges and clearly visible fracture lines, and the high-signal fracture line on T2 is as clear as that on T1. SF of the vertebral body has undergone basic repair and reconstruction due to injury, with changes only in appearance. Therefore, the signals presented at T1 and T2 are similar to those of normal vertebral bodies³². Based on the above characteristics, T2-weighted sequences are often preferred by spinal surgeons for evaluating lumbar fractures. Although fat suppression T2 sequences indeed enhance the identification of soft tissue lesions by suppressing fat signals, they are unsuitable for learning and training in deep learning due to noticeable and uncontrollable noise.

In synthetic MRI using a Self-pix network, the AF of the vertebrae displayed a high signal intensity with high sensitivity and specificity, which is extremely beneficial for clinical applications. For trauma patients with severe spinal fractures in the emergency department, recovery is closely related to rapid diagnosis and surgical decompression. An optimal postoperative spinal cord recovery requires completion of the operation decompression within 8 h⁶, which not only tests the surgeon’s skill but also requires the diagnostic efficiency of the radiology department. A fast spine MR examination takes at least 10 to 15 min, during which patients are required to remain motionless in a confined and enclosed space. To some extent, this immobility prolongs the patient's physical pain and may also cause psychological stress. Moreover, MRI scans are not available at night in most hospitals, resulting in extended time for the referral process and delaying optimal decompression and recovery. These conditions can cause irreversible damage to the spinal cord and have a serious impact on future quality of life. Moreover, MRI is not available for some patients, such as those with metal implants or claustrophobia. MR machines are not available in certain poor developing countries or remote areas. A lack of MR support can lead to insufficient understanding of fracture conditions, such as vertebral body information around obvious fractures, among surgeons and patients. An inappropriate treatment strategy might result in potentially exacerbated injuries. Based on the various limitations mentioned above, in the present study, high-quality synthesis MR images of lumbar fractures obtained from CT images will help surgeons and patients make accurate diagnoses as early as possible and then formulate treatment strategies.

In recent years, networks such as GAN have made significant advancements in the field of clinical medical imaging, including within specialised areas such as spinal surgery. Jiawei et al. developed a fully automated procedure to segment and quantitatively measure lumbar vertebrae and intervertebral discs³³. Ki-Taek et al.³⁴ used a GAN to synthesize lumbar MR images from CT images. Gao et al. achieved 3D reconstruction of the spine using 2D orthogonal X-ray images¹¹. While these studies have achieved major breakthroughs in technology, their application remains limited to normal vertebrae and has not yet been implemented in clinical practice. In addition, Li et al. explored the feasibility of automatically detecting vertebral fractures using DL³⁵, and Weijuan et al. employed a GAN to distinguish lumbar AF from DR²¹. Although these studies were relevant to the disease, the results were not in line with clinical practice. Spine surgeons are more concerned about the diagnosis and treatment of AF rather than LF detection, which is closely related to the prognosis of patients. MRI is the gold standard for diagnosing AF due to the abovementioned inconveniences. In an emergency setting, X-ray and CT serve as standard methods for diagnosing vertebral fractures, but there may be risks of missed diagnoses and delayed treatment, which can worsen the prognosis. To solve this problem, this study proposed a new disease-specific GAN for CT images-to-MR images.

In the present study, a new model named Self-pix is proposed, which is based on Pix2pix and introduces Self-Attention into the generator and Resnet blocks. Generating diseased MR images is more difficult than generating normal MR images due to the random area and degree of the disease. In addition, the learning and output of traditional GANs are too generalised to focus on diseased areas. Therefore, Self-Attention was introduced to emphasise areas with larger changes, which can be reflected in higher similarity values between the Query and the Key, leading togreater attention on these areas and the subsequent output of more realistic diseased MR images. This model was applied to LF for the first time. Comparing ROI with LF, Self-pix showed an increase of 14.91% in PSNR and 7.91% in SSIM, significantly surpassing the 13.48% and 6.50% of Pix2pix, which confirmed the consistency and enhanced image quality of the Self-Attention mechanism. This improvement further demonstrated that the Self-Attention mechanism mainly focused on the fractured vertebrae, aligning closely with our study's clinical goal. However, the PSNR averaged between 10 and 12 in the LF ROI, thus markedly deviating from visual perception. To address this issue, NFL MR images were specially trained and tested, where the PSNR reached approximately 18. This result aligned more closely with human visual perception. This discrepancy might be due to differences in brightness between CT and MR images. GAN learns CT and MR features simultaneously and transforms these features between the two when they are output. Although we tried to unify the brightness of CT and MR images, some systematic errors persisted. Therefore, the pixel value in synthetic MR images was affected by CT, and the high complexity of fractured vertebrae magnified this difference. As the PSNR was defined to calculate the difference in pixel values, synthetic MRI of LF had lower PSNR values than synthetic MRI of NLF. Based on the above findings and our experimental results, we believe that the PSNR may be unsuitable for assessing the similarity between synthetic LF MR images and their original MR images.

Our synthetic MR image shows promise for clinical application. AF occurs within a short period and is often characterised by severe pain and signs of acute injury. The fracture is unstable at this stage, and the vertebra may further collapse or even compress the spinal cord after movement⁴. As a result, more aggressive treatments are often used to relieve spinal cord compression or prevent further vertebrae collapse³⁶. SF can occur within several months or more prior to diagnosis. At this point, the vertebra has stabilised, but sequelae such as chronic pain and malunionmay continue⁴. Patients are prioritised for conservative therapies such as bed rest, pain management, brace fixation, and physical therapy³⁷. There are significant differences in disease progression and treatment strategies between patients with AF and SF, so it is important to distinguish between them. In this study, surgeons accurately identified SF and AF based on our synthetic MR images, with a sensitivity of 84.36% and a specificity of 96.65%. In AF, different surgical procedures are chosen depending on the degree of vertebral destruction. PKP/PVP is an interventional procedure in which bone cement is injected into the fractured vertebra through small incisions in the skin to stabilise the vertebra and reduce pain³⁸. Surgical injury is mild, and postoperative recovery is rapid. This procedure is suitable for compression fractures and currently stable fractures³⁹. MIF involves a relatively large area and incision and requires the insertion of internal fixation screws⁴⁰. The damage degree is greater, and the recovery is slower. This procedure suitable for treating fractures with severe vertebral collapse³⁸. The ODF is the most damaged, has the slowest recovery, and is suitable for burst fractures or patients with spinal cord compression^39,41,42,43. Surgeons choose the most beneficial operation for patients according to the degree of vertebral destruction and the patient’s physical situation. Among the 197 patients accurately diagnosed using synthetic MRI in the present study, 193 received the same treatment strategies as those used in the real-world study. Based on the results of this study, our synthetic MRI does not affect the diagnosis or treatment strategy of LF patients.

This was a large-sample study, starting from a clinical question and ultimately contributing to clinical practice. Our study has several limitations. First, CT scans were obtained from three different machines with various imaging styles. Despite our best efforts to standardise the data, unavoidable systematic error may have been introduced. Second, only the MR T2-weighted sequence was included, but in certain cases, it is challenging to distinguish between acute and sub-acute fractures without the support of T1 or fat-suppressed sequences, T1 and T2 with fat suppression will be investigated in future research.

Conclusion

In this study, we developed a disease-specific GAN with better image generation performance than the traditional GAN. The spine surgeon can correctly diagnose LF and select a treatment strategy based on Self-pix-generated T2 MR images. The PSNR may be unsuitable for assessing the similarity between synthetic MRI and original MRI in LF.

Data availability

All data generated or analyzed during this study were deposited at correspondence author’s ___location.

Abbreviations

LF:: Lumbar fracture
SF:: Sub-acute fracture
AF:: Acute fracture
CT:: Computerized tomography
MRI:: Magnetic resonance image
DL:: Deep learning
DR:: Digital radiography
GAN:: Generative adversarial network
NLF:: Non lumbar fracture
LFP:: Lumbar fracture participant
NLFP:: Non lumbar fracture participant
Mul:: Multiple fracture
NF:: No fracture
ADW:: Advantage workstation
MSE:: Mean square error
PSNR:: Peak signal-to-no ratio
SSIM:: Structural similarity
NAF:: No acute fracture
ConT:: Conservative treatment
PKP/PVP:: Percutaneous vertebroplasty/percutaneous kyphoplasty
MIF:: Minimally invasive fixation
ODF:: Open decompression and fixation
Resnet:: Residual network
PatchGAN:: Markovian discriminator
cGAN:: Conditional GAN
ROI:: Region of interest
R-MRI:: Real-MRI
S-MRI:: Synthetic-MRI
BMI:: Body mass index

References

Alsoof, D. et al. Diagnosis and management of vertebral compression fracture. Am. J. Med. 135(7), 815–821 (2022).
Article PubMed Google Scholar
Jin, C., Xu, G., Weng, D., Xie, M. & Qian, Y. Impact of magnetic resonance imaging on treatment-related decision making for osteoporotic vertebral compression fracture: A prospective randomized trial. Med. Sci. Monit. 24, 50–57 (2018).
Article PubMed PubMed Central Google Scholar
Cunningham, C. & Mc Laughlin, H. Physiotherapy post vertebral fragility fracture: A scoping review. Physiotherapy 119, 100–116 (2023).
Article CAS PubMed Google Scholar
Wood, K. B., Li, W., Lebl, D. R. & Ploumis, A. Management of thoracolumbar spine fractures. Spine J. 14(1), 145–164 (2014).
Article PubMed Google Scholar
Lenski, M., Büser, N. & Scherer, M. Concomitant and previous osteoporotic vertebral fractures. Acta Orthop. 88(2), 192–197 (2017).
Article PubMed PubMed Central Google Scholar
Kato, S. et al. Does surgical intervention or timing of surgery have an effect on neurological recovery in the setting of a thoracolumbar burst fracture?. J. Orthop. Trauma 31(Suppl 4), S38-s43 (2017).
Article PubMed Google Scholar
Hallinan, J. et al. Deep learning model for automated detection and classification of central canal, lateral recess, and neural foraminal stenosis at lumbar spine MRI. Radiology 300(1), 130–138 (2021).
Article PubMed Google Scholar
Wu, W., Pan, J., Wang, Y., Wang, S. & Zhang, J. Multi-channel optimization generative model for stable ultra-sparse-view CT reconstruction. IEEE Trans. Med. Imag. https://doi.org/10.1109/TMI.2024.3376414 (2024).
Article Google Scholar
Zhao, M., Liu, X., Liu, H. & Wong, K. K. L. Super-resolution of cardiac magnetic resonance images using laplacian pyramid based on generative adversarial networks. Comput. Med. Imaging Graph. 80, 101698 (2020).
Article PubMed Google Scholar
Xiao, Y. et al. A novel hybrid generative adversarial network for CT and MRI super-resolution reconstruction. Phys. Med. Biol. 68(13), 135007 (2023).
Article Google Scholar
Gao, Y. et al. 3DSRNet: 3D spine reconstruction network using 2D orthogonal X-ray images based on deep learning. IEEE Trans. Instrum. Meas. https://doi.org/10.1109/TIM.2023.3296838 (2023).
Article PubMed Google Scholar
Qiu, S. et al. Multimodal deep learning for Alzheimer’s disease dementia assessment. Nat. Commun. 13(1), 3404 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Jiang, B. et al. Deep learning reconstruction shows better lung nodule detection for ultra-low-dose chest CT. Radiology 303(1), 202–212 (2022).
Article PubMed Google Scholar
Lyu, J. et al. Generative adversarial network-based noncontrast CT angiography for aorta and carotid arteries. Radiology 309(2), e230681 (2023).
Article PubMed Google Scholar
Goodfellow, I., Pouget-Abadie, J., Mirza, M et al. Generative adversarial nets. 2014; 27.
Halupka, K. J. et al. Retinal optical coherence tomography image enhancement via deep learning. Biomed. Opt. Express. 9(12), 6205–6221 (2018).
Article PubMed PubMed Central Google Scholar
Ma, Y. et al. Structure and illumination constrained GAN for medical image enhancement. IEEE Trans. Med. Imaging 40(12), 3955–3967 (2021).
Article PubMed Google Scholar
Zhou, H., Ma, J., Tan, C. C., Zhang, Y. & Ling, H. Cross-weather image alignment via latent generative model with intensity consistency. IEEE Trans. Image Process. 29, 5216 (2020).
Article ADS Google Scholar
Huang, S., Sun, L., Yousefnezhad, M., Wang, M. & Zhang, D. Functional alignment-auxiliary generative adversarial network-based visual stimuli reconstruction via multi-subject fMRI. IEEE Trans. Neural Syst. Rehabil. Eng. 31, 2715–2725 (2023).
Article PubMed Google Scholar
Touati, R., Le, W. T. & Kadoury, S. A feature invariant generative adversarial network for head and neck MRI/CT image synthesis. Phys. Med. Biol. 66(9), 095001 (2021).
Article Google Scholar
Chen, W. et al. A deep-learning model for identifying acute vertebral compression fractures on digital radiography. Eur. Radiol. 32(3), 1496–1505 (2022).
Article PubMed Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., et al. Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, California, USA: Curran Associates Inc.; 2017:6000–6010.
He, K., Zhang, X., Ren, S., Sun, J. Deep residual learning for image recognition. In Paper presented at: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); 27–30 June 2016, 2016:770–778.
Zhu, J.Y., Park, T., Isola, P., Efros, A.A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Paper presented at: 2017 IEEE International Conference on Computer Vision (ICCV); 22–29 Oct. 2017, 2017:2242–2251.
Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A. Image-to-image translation with conditional adversarial networks. In Paper presented at: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); 21–26 July 2017, 2017:5967–5976.
Ronneberger, O. & Fischer, P. Brox T Convolutional Networks for Biomedical Image Segmentation 234–241 (Springer, 2015).
Google Scholar
Mirza, M., Osindero, S. Conditional Generative Adversarial Nets.
Parmar, N., Vaswani, A., Uszkoreit, J., et al. Image Transformer. In Paper presented at the International Conference on Machine Learning, Stockholm, Sweden, July 10–15, 2018. Proceedings of Machine Learning Research; 80:4055–4064.
Zhang, H., Goodfellow, I., Metaxas, D. & Odena, A. Self-attention generative adversarial networks. Int. Conf. Mach. Learn. 97, 7354–7363 (2019).
Google Scholar
Miyato, T., Kataoka, T., Koyama, M., & Yoshida, Y. Spectral normalization for generative adversarial networks. In Proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 30–May 3, 2018.
Chow, L. S. Paramesran RJBsp, control. Rev. Med. Image Qual. Assess. 27, 145–154 (2016).
Google Scholar
Duy, P. Q., Ikuta, I., Johnson, M. H., Davis, M. & Zohrabian, V. M. MRI in Spine Trauma. In MRI of the Spine: A guide for orthopedic surgeons (eds Morrison, W. B. et al.) 31–86 (Springer, 2020).
Chapter Google Scholar
Huang, J. et al. Spine explorer: A deep learning based fully automated program for efficient and reliable quantifications of the vertebrae and discs on sagittal lumbar spine MR images. Spine J. 20(4), 590–599 (2020).
Article PubMed Google Scholar
Hong, K. T. et al. Lumbar spine computed tomography to magnetic resonance imaging synthesis using generative adversarial network: Visual turing test. Diagnostics 12(2), 530 (2022).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. C. et al. Can a deep-learning model for the automated detection of vertebral fractures approach the performance level of human subspecialists?. Clin. Orthop. Relat. Res. 479(7), 1598–1612 (2021).
Article PubMed PubMed Central Google Scholar
Muratore, M., Ferrera, A., Masse, A. & Bistolfi, A. Can we predict the progression of vertebral collapse in conservative treatment of osteoporotic vertebral fractures? A 3-year retrospective study of 180 patients from the emergency department. Int. J. Spine Surg. 14(4), 641–648 (2020).
Article PubMed PubMed Central Google Scholar
Spiegl, U. J. et al. The conservative treatment of traumatic thoracolumbar vertebral fractures. Dtsch. Arztebl. Int. 115(42), 697–704 (2018).
PubMed Google Scholar
Zhu, D. et al. A comparison between modified unilateral extrapedicular and bilateral transpedicular percutaneous kyphoplasty in the treatment of lumbar osteoporotic vertebral compression fracture. World Neurosurg. 166, e99–e108 (2022).
Article PubMed Google Scholar
Drazin, D. et al. Outcomes and national trends for the surgical treatment of lumbar spine trauma. Biomed. Res. Int. 2016, 3623875 (2016).
Article PubMed PubMed Central Google Scholar
Caruso, G. et al. Minimally invasive fixation techniques for thoracolumbar fractures: Comparison between percutaneous pedicle screw with intermediate screw (PPSIS) and percutaneous pedicle screw with kyphoplasty (PPSK). Eur. J. Orthop. Surg. Traumatol. 28(5), 849–858 (2018).
Article PubMed Google Scholar
Zeng, Z., Zhang, D., Zeng, F. L. & Ao, J. Posterior unilateral small fenestration of lamina combined with a custom-made Y-shaped fracture reduction device for the treatment of severe thoracolumbar burst fracture: A prospective comparative study. J. Orthop. Surg. Res. 18(1), 529 (2023).
Article PubMed PubMed Central Google Scholar
Pan, J. et al. Iterative residual optimization network for limited-angle tomographic reconstruction. IEEE Trans. Image Process. 33, 910–925 (2024).
Article ADS PubMed Google Scholar
Wu, W., Wang, Y., Liu, Q., Wang, G. & Zhang, J. Wavelet-improved score-based generative model for medical imaging. IEEE Trans. Med. Imag. 43(3), 966–979 (2024).
Article Google Scholar

Download references

Acknowledgements

Thank Yong Zong and Song Chen for qualitative assessment of all images in the study.

Funding

The study was supported by National Natural Science Foundation of Hubei province (2023AFB1006) and Hubei Provincial Health Commission Young Talent Programme (WJ2023Q020).

Author information

These authors contributed equally: Ke Song and Wendong Zhu.

Authors and Affiliations

The First College of Clinical Medical Science, China Three Gorges University, Yichang, 443000, China
Ke Song, Meiling Zhang, Jie Liang & Weifei Wu
Yichang Central People’s Hospital, Yichang, 443000, China
Ke Song, Meiling Zhang, Jie Liang & Weifei Wu
College of Computer and Information Technology, China Three Gorges University, Yichang, 430002, China
Wendong Zhu & Tinglong Tang
School of Biomedical Engineering, Sun Yat-Sen University, Shenzhen, 518107, China
Zhenxi Zhang
Wendeng Orthopaedic and Traumatologic Hospital of Shandong Province, Weihai, 264400, China
Bin Liu

Authors

Ke Song
View author publications
Search author on:PubMed Google Scholar
Wendong Zhu
View author publications
Search author on:PubMed Google Scholar
Zhenxi Zhang
View author publications
Search author on:PubMed Google Scholar
Bin Liu
View author publications
Search author on:PubMed Google Scholar
Meiling Zhang
View author publications
Search author on:PubMed Google Scholar
Tinglong Tang
View author publications
Search author on:PubMed Google Scholar
Jie Liang
View author publications
Search author on:PubMed Google Scholar
Weifei Wu
View author publications
Search author on:PubMed Google Scholar

Contributions

The authors report no conflict of interest concerning the materials or methods used in this study or findings specified in this paper. Author contributions to the study and manuscript preparation include the following. Conception and design: WF W. clinical assessment: WF W, K S, B L, ML Z, J L. Neural network development: WD Z, ZX Z,TL T. Data measurement and analysis: K S,WD Z,ZX Z. Drafting the article: K S,WD Z,ZX Z,B L,ML Z ,TL T ,J L,WF W. Critically revising the article: WF W. Reviewed final version of the manuscript and approved it for submission: all authors. Study supervision: WF W. The authors report no conflict of interest concerning the materials or methods used in this study or findings specified in this paper. The authors have reviewed the final version of the manuscript and approve it for publication.

Corresponding author

Correspondence to Weifei Wu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval

The present study was approved by the review boards of the Three Gorges University.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Song, K., Zhu, W., Zhang, Z. et al. Synthetic lumbar MRI can aid in diagnosis and treatment strategies based on self-pix networks. Sci Rep 14, 20382 (2024). https://doi.org/10.1038/s41598-024-71288-4

Download citation

Received: 28 January 2024
Accepted: 27 August 2024
Published: 02 September 2024
DOI: https://doi.org/10.1038/s41598-024-71288-4

Subjects

Abstract

Similar content being viewed by others

Translation of computed tomography images to T2-Weighted magnetic resonance images of lumbar spine using generative adversarial networks on sagittal images

ISRSL0: compressed sensing MRI with image smoothness regularized-smoothed \(\:{\ell}_{0}\) norm

Lumbar spine segmentation in MR images: a dataset and a public benchmark

Introduction

Methods and materials

Patients and images

CT and MR scans

Paired data processing

Model architecture

Generator

Discriminator

Objective function

Self-attention

Quantitative image analysis

Qualitative image analysis

Statistical analysis

Results

Quantitative assessment

Qualitative assessment

Discussion

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethics approval

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links