Neural network application for assessing thyroid-associated orbitopathy activity using orbital computed tomography

Lee, Jaesung; Lee, Sanghyuck; Lee, Won Jun; Moon, Nam Ju; Lee, Jeong Kyu

doi:10.1038/s41598-023-40331-1

Download PDF

Article
Open access
Published: 10 August 2023

Neural network application for assessing thyroid-associated orbitopathy activity using orbital computed tomography

Jaesung Lee^1,2,
Sanghyuck Lee¹,
Won Jun Lee³,
Nam Ju Moon³ &
…
Jeong Kyu Lee ORCID: orcid.org/0000-0001-9797-9945³

Scientific Reports volume 13, Article number: 13018 (2023) Cite this article

1587 Accesses
9 Citations
Metrics details

Subjects

Abstract

This study aimed to propose a neural network (NN)-based method to evaluate thyroid-associated orbitopathy (TAO) patient activity using orbital computed tomography (CT). Orbital CT scans were obtained from 144 active and 288 inactive TAO patients. These CT scans were preprocessed by selecting eleven slices from axial, coronal, and sagittal planes and segmenting the region of interest. We devised an NN employing information extracted from 13 pipelines to assess these slices and clinical patient age and sex data for TAO activity evaluation. The proposed NN’s performance in evaluating active and inactive TAO patients achieved a 0.871 area under the receiver operating curve (AUROC), 0.786 sensitivity, and 0.779 specificity values. In contrast, the comparison models CSPDenseNet and ConvNeXt were significantly inferior to the proposed model, with 0.819 (p = 0.029) and 0.774 (p = 0.04) AUROC values, respectively. Ablation studies based on the Sequential Forward Selection algorithm identified vital information for optimal performance and evidenced that NNs performed best with three to five active pipelines. This study establishes a promising TAO activity diagnosing tool with further validation.

Neural network-based method for diagnosis and severity assessment of Graves’ orbitopathy using orbital computed tomography

Article Open access 15 July 2022

Neural network-based automated proptosis measurement using computed tomography images for patients with thyroid-associated orbitopathy

Article Open access 08 November 2024

Performance analysis and knowledge-based quality assurance of critical organ auto-segmentation for pediatric craniospinal irradiation

Article Open access 21 February 2024

Introduction

Thyroid-associated orbitopathy (TAO) is an autoimmune disorder associated with Graves’ disease (GD)^1,2. TAO patients endure various symptoms, from dry eye and tearing to severe functional abnormalities such as visual disturbances and diplopia. These symptoms commence with eyelid and orbit inflammation that exacerbates as inflammation progresses³. Therefore, inflammation onset and severity evaluations are vital for designing treatment policies and predicting TAO prognosis. The clinical activity score (CAS) is the most widely used inflammatory degree assessment for TAO patients but partly relies on subjective responses^4,5. In addition, CAS cannot sufficiently distinguish active inflammation from congestive changes in orbital soft tissue, a common severe TAO symptom. Therefore, a more objective and quantified method for evaluating orbital inflammation is imperative.

Radiological examinations have been extensively studied as potential alternatives for evaluating orbital inflammation, with computed tomography (CT) the most widely used in TAO diagnosis. Several studies certify that CT facilitates inflammation assessment in TAO patients^6,7. However, since CT image interpretation is highly dependent on clinician experience, interpreted data may be incomplete or engender inconsistencies. Neural network (NN)-based method efficacies in eyelid and orbital disease have recently garnered attention^8,9,10, and several studies have integrated NNs into radiographic image analysis to overcome these shortcomings^11,12,13. Orbital image analysis using an NN can be applied for TAO diagnosis, and TAO activity evaluation using MRI has been recently reported¹⁴. However, an NN-based method for TAO activity evaluation using CT images, which are more widely used and cost-effective than MRI, has yet to be considered.

This study devises a new NN that evaluates TAO activity by analyzing CT images. Although TAO is more common in women, older men have worse clinical features¹⁵. However, it is unclear whether additional clinical information, such as age and sex, can improve NN performance. Therefore, we also strove to confirm whether NN performance could be improved by supplementing orbital CT images with clinical data.

Results

This study included 144 active (52 men, 92 women) and 288 (41 men, 247 women) inactive TAO patients; the male subject proportion was higher in the active group than in the inactive group. The mean active TAO patient age was 46.1 ± 13.6 years, older than inactive TAO patients (p < 0.001). Table 1 organizes these characteristics.

Table 1 Subject characteristics.

Full size table

Table 2 summarizes the proposed and two comparative NN performance evaluations. The proposed NN’s performance in evaluating active and inactive TAO patients achieved a 0.871 area under the receiver operating curve (AUROC), 0.786 sensitivity, and 0.779 specificity values. In contrast, the comparison models CSPDenseNet and ConvNeXt were significantly inferior the proposed model, with 0.819 (p = 0.029) and 0.774 (p = 0.004) AUROC, 0.774 (p = 0.028) and 0.694 (p = 0.005) sensitivity, and 0.731 (p = 0.110) and 0.692 (p = 0.008) specificity values, respectively. Notably, the proposed NN significantly outperformed ConvNeXt in four additional metrics. In contrast, there was no statistically significant accuracy (p = 0.071), F1 score (p = 0.052), or precision (p = 0.069) difference between our proposed model and CSPDenseNet.

Table 2 Performance evaluation of the neural network models’ TAO activity assessment.

Full size table

We identified CT slices vital for NN performance through an ablation study that activated or deactivated input pipelines. Table 3 conveys the top five pipeline combination performances identified from our ablation study and sorts them by AUROC value. The best pipeline combination was SA_R, CO2, CO1, and AX1 with the highest value in six metrics (AUROC, 0.871; accuracy, 0.803; F1 score, 0.732; sensitivity, 0.805; specificity, 0.802; precision, 0.671). The second-best combination was CO2 and SA_R with a 0.871 AUROC.

Table 3 Top five AUROC-sorted pipeline combinations among the ten identified from the pipeline enable/disable procedure.

Full size table

The AUROC value shift relative to the enabled pipeline numbers was also examined to show how pipeline quantity affects NN performance. Experimental results demonstrated that NN performance was maximized on average with three to five enabled pipelines (Fig. 1). At six or more, performance gradually dwindled as enabled pipeline numbers increased. For further discussion, we visualized the critical locations in the input image when the model was based on the best pipeline combination (Fig. 2) The heat maps establish that the NN diagnoses the patient as active by extracting information from the medial rectus muscle in CO2 and the central part of the superior and inferior rectus muscles in SA_R.

Discussion

This study introduced a NN that can evaluate TAO patient activity using orbital CT image with 0.871 AUROC, 0.782 accuracy, 0.786 sensitivity, and 0.779 specificity score. As a 0.8 AUROC or higher is considered good¹⁶, the proposed NN effectively assist in ascertaining activity and treatment plans for TAO patient. However, this performance could be improved for practical applications. Our results were slightly lower than the 0.922 AUC from a previous study that evaluated TAO patient activity by applying a NN to an orbital MRI¹⁴, potentially due to NN performance variability or MRI’s sharper resolution. CT is deemed inferior to MRI regarding image detail, as it is unclear whether the image purely reflects acute inflammation in orbital tissues. Nevertheless, the rarity of studies that have evaluated TAO patient activity with orbital CT images highlights the need for improved performance and this study’s significance.

The proposed NN outperformed ConvNeXt, a comparative model, in all performance comparisons and exhibited significantly higher AUROC and sensitivity scores than CSPDenseNet. As the proposed NN is configured to allow multiple slices to be passed through multiple heads simultaneously, it can extract and fuse vital information to improve performance. In addition, most existing NNs were primarily designed to receive a single image (RGB or grayscale). However, learning three-dimensional (3D) CT sequencing’s detailed anatomical information is challenging for these architectures. Furthermore, the enable or disable pipeline procedure in the ablation study may contribute to the improvement by automatically modifying the model to receive only essential information. Through these two approaches, we constructed a framework capable of accurately assisting TAO activity assessments with improved performance over comparative models.

The algorithm in our ablation study also assists in determining significant slice and tissue changes in CT image interpretation. Our results indicated that CO2 (the coronal slice located 1/2 distance between the largest eyeball and orbital exit slices) and SA_R (the sagittal slice for the right eye) were selected most frequently during the early procedure stages. Excessive CT slices diminish NN performance due to unnecessary information. Therefore, identifying the minimum CT slice amount with the greatest efficacy is necessary for improving NN performance. Previous studies that applied NNs to CT also selected and analyzed a few CT slices rather than analyzing all CT images^17,18. 3D CT reconstruction may help preserve as much necessary information as possible.

Interestingly, we determined that incorporating age and sex did not notably improve NN performance. Age and sex are standard patient information, and epidemiological studies have established that TAO is more common in women, whereas older men are more likely to suffer from worse clinical features^15,19. Therefore, the AI models also analyzed this data as it could potentially enhance their understanding and analysis of TAO patient activity. Nevertheless, age and gender were not conducive to determining TAO activity in this study. Although the exact reason remains unclear, CT images may already incorporate data more efficacious for determining TAO activity than age or sex. As such, TAO patient activity can likely be judged in practical settings without this information if clinical characteristics such as proptosis, diplopia, eyelid, and anterior segment finding are sufficiently provided. Alternatively, age and sex may already be considered in CT images^20,21. Thus, age and gender data may not influence NN performance. Further research is needed to conclude if supplemental clinical data related to TAO activity can improve NN performance, such as thyroid function tests, thyroid-stimulating hormones receptor antibody levels, smoking status, or clinical photos.

We compared the proposed NN with two conventional NNs to evaluate its performance. CSPDenseNet has recently performed well as a convolutional NN-based model in image classification and is the proposed model’s baseline. In addition, DenseNet-based models are widely used in the medical field regardless of disease or imaging modality^22,23,24. ConvNeXt, based on ResNet²⁵, is the most modern NN architecture developed capable of reaching Transformer-based classification models’ overwhelming performance. ResNet performance was improved by applying novel CNN technologies, such as changing the stage-compute ratio. In addition, ResNet is the most renowned model in computer vision and medical image analysis^26,27,28.

Although designing a NN by selecting all CT slices is theoretically better, surplus information will likely amplify noise and diminish performance. Therefore, only eleven CT slices were selected and analyzed per subject. Researcher favor AX1, the slice with the largest lens in the axial plane, for exophthalmos evaluation using CT scans^29,30. In addition, CO2 and CO3 coronal slices are 1/2 to 2/3 of the distance from the largest eyeball slice (CO1) to the orbit exit and are frequently recommended for analyzing extraocular muscle thickness or determining compressive optic neuropathy^31,32. AX2 to AX5, CO2, and CO3 slices are automatically selected from AX1 and CO1, respectively. Furthermore, we selected AX6 for lacrimal gland representation and a sagittal slice expected to show eyelid changes and superior and inferior rectus muscles shifts³³. These eleven slices were selected as they best reflected CT changes.

Our study has several limitations. First, section bias may exist as the datasets were collected from a single center. In addition, no standardized orbital CT dataset has yet to be established for diagnosing TAO. Furthermore, enlarged datasets are required for additional training and validation. Since convolutional NN-based deep learning requires significant data, larger dataset could improve diagnostic performance. In addition, the CAS assessment can only estimate inflammation and is sometimes insufficient to determine an accurate active status. Therefore, CAS-based NN diagnostic accuracy may be restricted by the limitation of CAS itself. At this stage, NN applications should be limited to complementing CAS rather than replacing it.

In conclusion, we substantiated NNs’ applicability in assessing TAO activity using orbital CTs. The proposed NN can reliably distinguish between active and inactive TAO patients. To our knowledge, there has yet to be a study on NN-based activity evaluation in TAO patients using orbital CT. The utilized code is publicly available at https://github.com/tkdgur658/MTANet. Although we have paid significant attention to TAO activity evaluation, it is essential to improve activity discrimination efficiency further. Our developed NN will assist in accurately diagnosing and evaluating TAO patients and become the basis for smart diagnosis.

Methods

The Institutional Review Board of Chung-Ang University Hospital approved this study (IRB No, 2029-028-19439), and the informed consent requirement was waived due to its retrospective design. This study was conducted in accordance with the ethical standards outlined in the Declaration of Helsinki.

Participants

We obtained orbital CT scans (Philips Brilliance 256 Slice CT, Philips Healthcare Systems, Andover, MA, USA) without contrast from TAO patients diagnosed between January 2010 to October 2019. Our study included 432 TAO patients, among whom 144 were diagnosed as active and 288 inactive. TAO patients were diagnosed according to Bartley and Gorman’s criteria³⁴. A seven-point modified CAS formula assessed inflammatory activity by assigning a point to each item: retrobulbar pain, eye movement pain, eyelid redness, conjunctival injection, caruncle or plica inflammation, eyelid swelling, and chemosis. TAO patients with a CAS ≥ 3 were classified as active, while patients with CAS < 3 were classified as inactive. The CT scan and inflammatory activity evaluation were performed on the same day. Two ophthalmologists with more than five years of experience in oculoplasty were blinded to patient information for the CT image analysis and clinical inflammatory activity evaluation. The two experts jointly reviewed the images to reach a consensus in case of a disagreement. Patients below age 18, with a previous history of orbital surgery, orbital tumor, blowout fracture, idiopathic orbital inflammation, those with IV steroid treatment or radiation therapy at the time of CT scan taken, and those with incomplete CT scans were excluded.

Slice selection

Each patient’s orbital CT had 80 to 400 image slices. However, only a few CT slices were selected from axial, coronal, and sagittal planes to improve TAO activity evaluation performance by avoiding diagnostic model confusion due to redundant information. First, we selected the slice with the largest lens in the axial plane (AX1), then slices 3 mm above (AX2) and below (AX3) and 7 mm above (AX4) and below (AX5) AX1. Next, the slice with the largest lacrimal gland was chosen (AX6). In the coronal plane, the slice exhibiting the largest eyeball was selected first (CO1). We then picked slices 1/2 and 2/3 of the distance between CO1 and the orbit exit (CO2, CO3). Next, slices with the largest eyeball in both eyes were selected from the sagittal plane (SA_L, SA_R). Eleven CT slices were selected for each subject: six axial, three coronal, and two sagittal plane slices.

Data preparation and processing

After slice selection, we performed the Hounsfield Unit windowing process for better structure identification. We used the Pydicom library’s Value of Interest Look Up Table (VOI LUT) function to convert the original CT pixel values into values ranging from 0 to 1. In this study, we set the Window Center to 0 and the Window Width to 200. Next, we segmented identifiable structures from the 11 CT slices, including the eyeball, four rectus muscles, the optic nerve, and the orbital fat with a few exceptions. Because the superior rectus and superior levator palpebrae muscles could not be reliably distinguished from each other, they were segmented together as a single muscle group, namely the levator-superior rectus (SR) complex. The oblique muscles were excluded as they are difficult to distinguish clearly in CT images. Since AX6 was selected to represent the lacrimal glands, only the lacrimal glands were segmented in AX6. In addition, we further segmented the upper eyelid from two sagittal slices due to its TAO relevance. Finally, we segmented the entire orbit in the CO3 slice because it was difficult to distinguish between each four rectus muscles and the optic nerve (Table 4). As a result, we acquired 78 segmentation images from eleven selected slices.

Table 4 Segmented structure in CT slices.

Full size table

Neural network model

This study defined three considerations to achieve the best TAO activity evaluation performance when devising the proposed NN. First, input slices were chosen from three planes instead of one to capitalize on the advantage of information from multiple views. Second, to exploit possible interactions among identifiable structures in the same slice, the proposed NN processed all segmented images from one slice through a single pipeline; each segmented image was encoded into a channel. Third, we accomplished a pipeline combination ablation study to avoid possible evaluation degradation. Figure 3 illustrates the proposed NN’s overall architecture consisting of pipeline heads, a bottleneck layer, and a model body.

Each CT slice head is an operation sequence constituting 7 × 7 convolution, batch normalization (BN), and ReLU layers and one Cross Stage Partial (CSP) Block. The standard convolution layer output values were calculated with ${y}_{i,j,k}={{w}_{k}}^{T}{x}_{i,j}+{b}_{k}$ where ${x}_{i,j}$ was the input value subset centered at $(i, j)$, ${y}_{i,j,k}$ was the output value at $(i, j)$ in the $k$th feature map, and ${w}_{k}$ and ${b}_{k}$ were the $k$th filter’s weight vector and bias, respectively. ReLU activation function was defined as $ReLU(x)=max(x, 0)$, and the max and average pooling operation output values were ${y}_{i,j,k}=a$ and ${y}_{i,j,k}=\frac{1}{|{x}_{i,j}|}{\sum }_{a\in {x}_{i,j,k}}a$ respectively, where ${x}_{i,j,k}$ was the input value subset centered at ($i, j)$ in the $k$th input feature map and ${y}_{i,j,k}$ is the output value at $(i, j)$ in the $k$th output feature map. The CSP Dense Block was a modified Dense Block that reused vast amounts of gradient information with an improved gradient duplication problem and promising performance^{35,36,37,38,39}. The Dense Block was a convolution block with multiple densely connected Dense layers⁴⁰.

The bottleneck layer extracted essential information by compressing head-generated feature maps. Each pipeline head outputs 64 feature maps, generating (number of input CT slices) × 64 feature maps. Next, the proposed NN compresses feature maps based on a 1 $\times $ 1 convolution. The bottleneck layer constitutes a BN layer, a ReLU layer, and a 1 $\times $ 1 convolution. Feature maps passing through the bottleneck were finally compressed into 64 channels. The rest is the model body that processed the compressed feature maps through three CSP Dense Blocks containing 12, 24, and 16 Dense layers. Next, spatial features were compressed into a one-dimensional vector by a 7 × 7 average pooling. The proposed NN concatenated age and sex data with this one-dimensional vector through one-hot encoding, which was considered a pipeline. Finally, the concatenated vector is fed into a fully connected layer and a sigmoid function to output class probabilities. The final active probability $y$ was calculated by

$$y=F(B\left({h}_{1}\left({x}_{1}^{1}, \dots , {x}_{1}^{{n}_{1}}\right), \dots , {h}_{m}\left({x}_{m}^{1}, \dots ,{x}_{m}^{{n}_{m}} \right)\right), c)$$

where ${x}_{i}^{{i}_{k}}$ was the $i$th selected slice’s $j$th segmented image, ${h}_{i}$ was $i$th head, $B$ was a composition function from the bottleneck layer to a 7 × 7 average pooling, $F$ was the fully connected layer, and $c$ was clinical data.

Ablation study

The proposed NN individually examines input data (11 slices, age and sex) using each pipeline; thus, disabling uninfluential pipelines can distinguish essential information for TAO activity evaluation. To achieve this, we devised a procedure that enables or disables the 13 pipelines by referring to validation performance and estimating the optimal pipeline set. First, the algorithm checks the 13 single-input pipeline performances by only enabling one pipeline at a time to determine the best single-input pipeline. Next, in addition to the best single-input pipeline in the previous stage, a single-input disabled pipeline is enabled to identify the best dual-input pipeline. The algorithm again enables a single-input disabled pipeline with the best dual-input pipeline. The algorithm repeats this procedure until it has considered all 13 pipelines. Finally, the algorithm outputs the final pipeline combination with the best evaluation performance.

Neural network evaluation

We employed two latest NNs in computer vision to verify the proposed NN’s performance: CSPDenseNet and ConvNeXt. Similar to the proposed NN, the two latest NNs have been slightly modified to receive size 224 $\times $ 224 $\times $ 78 CT slices, age, and sex for a fair comparison. We implemented these models with the Pytorch (1.10.1) library, and all experiments were conducted in a Geforce RTX 3090 24 GB environment. The hyper-parameters were set to a 32 batch size, 30 epochs, the AdamW optimizer, and a 1e-3 learning rate halved every ten epochs by a step-learning rate scheduler. We assessed the three NN performances on six metrics: area under the receiver operating characteristics (AUROC), accuracy, F1-score, sensitivity, specificity, and precision. For training and evaluation, 432 patients were divided into training, validation, and test sets at 0.7, 0.15, and 0.15 ratios extracted through stratified random sampling. The data split process, training, and test were repeated ten times. Furthermore, we conducted additional model comparison experiments based on the selected ten pipeline subsets with the same settings (environment, hyper-parameters, evaluation metrics, data split, repetition). As a result, we could discern essential CT slices out of the collective to aid medical expert interpretation.

Visual explanation method

Gradient-weighted Class Activation Mapping (Grad-CAM) assisted us in analyzing experimental results⁴¹ as it generates a visual description regarding the final NN’s decisions. First, Grad-CAM uses gradients for class probabilities to create a coarse score map highlighting essential decision locations. Specifically, a filter’s average gradient values of a specific layer are regarded as that feature map’s importance. Next, each feature map and layer importance are multiplied. Then, all multiplied feature maps are averaged along the channel axis to produce a coarse score map of each region. Since the model receives multiple input images in our visualization, we used the head's feature map and gradient for each slice visualization.

Statistical analysis

Descriptive statistics are shown as the mean ± standard deviation for continuous variables and the number and percentage for categorical variables. Measured continuous variable comparisons were analyzed using a pair-wise t-test. We used Python's open-source library, Scikit-learn, for all statistical analyses of our study; p-values $<$ 0.05 were considered statistically significant.

Data availability

Datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

References

Fang, S. et al. Insights into local orbital immunity: Evidence for the involvement of the Th17 cell pathway in thyroid-associated ophthalmopathy. J. Clin. Endocrinol. Metab. 104, 1697–1711. https://doi.org/10.1210/jc.2018-01626 (2019).
Article PubMed Google Scholar
Piantanida, E., Tanda, M. L., Lai, A., Sassi, L. & Bartalena, L. Prevalence and natural history of Graves’ orbitopathy in the XXI century. J. Endocrinol. Invest. 36, 444–449. https://doi.org/10.3275/8937 (2013).
Article CAS PubMed Google Scholar
Hoang, T. D., Stocker, D. J., Chou, E. L. & Burch, H. B. 2022 update on clinical management of graves disease and thyroid eye disease. Endocrinol. Metab. Clin. N. Am. 51, 287–304. https://doi.org/10.1016/j.ecl.2021.12.004 (2022).
Article Google Scholar
Mourits, M. P. et al. Clinical criteria for the assessment of disease activity in Graves’ ophthalmopathy: A novel approach. Br. J. Ophthalmol. 73, 639–644. https://doi.org/10.1136/bjo.73.8.639 (1989).
Article CAS PubMed PubMed Central Google Scholar
Dickinson, A. J. & Perros, P. Controversies in the clinical evaluation of active thyroid-associated orbitopathy: Use of a detailed protocol with comparative photographs for objective assessment. Clin. Endocrinol. 55, 283–303. https://doi.org/10.1046/j.1365-2265.2001.01349.x (2001).
Article CAS Google Scholar
Byun, J. S., Moon, N. J. & Lee, J. K. Quantitative analysis of orbital soft tissues on computed tomography to assess the activity of thyroid-associated orbitopathy. Graefes Arch. Clin. Exp. 255, 413–420. https://doi.org/10.1007/s00417-016-3538-0 (2017).
Article Google Scholar
Regensburg, N. I., Wiersinga, W. M., Berendschot, T. T., Saeed, P. & Mourits, M. P. Densities of orbital fat and extraocular muscles in graves orbitopathy patients and controls. Ophthalmic Plast. Reconstr. Surg. 27, 236–240. https://doi.org/10.1097/IOP.0b013e31820365d5 (2011).
Article PubMed Google Scholar
Lei, C. et al. Facial expression of patients with Graves’ orbitopathy. J. Endocrinol. Invest. https://doi.org/10.1007/s40618-023-02054-y (2023).
Article PubMed Google Scholar
Liu, N. et al. An eyelid parameters auto-measuring method based on 3D scanning. Displays 69, 102063 (2021).
Article Google Scholar
Song, X. et al. A clinical decision model based on machine learning for ptosis. BMC Ophthalmol. 21, 169. https://doi.org/10.1186/s12886-021-01923-5 (2021).
Article PubMed PubMed Central Google Scholar
Hamwood, J., Schmutz, B., Collins, M. J., Allenby, M. C. & Alonso-Caneiro, D. A deep learning method for automatic segmentation of the bony orbit in MRI and CT images. Sci. Rep. 11, 13693. https://doi.org/10.1038/s41598-021-93227-3 (2021).
Article CAS ADS PubMed PubMed Central Google Scholar
Li, L. et al. Deep convolutional neural networks for automatic detection of orbital blowout fractures. J. Craniofac. Surg. 31, 400–403. https://doi.org/10.1097/scs.0000000000006069 (2020).
Article PubMed Google Scholar
Song, X. et al. Artificial intelligence CT screening model for thyroid-associated ophthalmopathy and tests under clinical conditions. Int. J. Comput. Assist. Radiol. Surg. 16, 323–330. https://doi.org/10.1007/s11548-020-02281-1 (2021).
Article PubMed Google Scholar
Lin, C. et al. Detection of active and inactive phases of thyroid-associated ophthalmopathy using deep convolutional neural network. BMC Ophthalmol. 21, 39. https://doi.org/10.1186/s12886-020-01783-5 (2021).
Article PubMed PubMed Central Google Scholar
Perros, P., Crombie, A. L., Matthews, J. N. & Kendall-Taylor, P. Age and gender influence the severity of thyroid-associated ophthalmopathy: A study of 101 patients attending a combined thyroid-eye clinic. Clin. Endocrinol. 38, 367–372. https://doi.org/10.1111/j.1365-2265.1993.tb00516.x (1993).
Article CAS Google Scholar
Nahm, F. S. Receiver operating characteristic curve: Overview and practical use for clinicians. Korean J. Anesthesiol. 75, 25–36. https://doi.org/10.4097/kja.21209 (2022).
Article PubMed PubMed Central Google Scholar
Hanai, K. et al. Automated detection of enlarged extraocular muscle in Graves’ ophthalmopathy with computed tomography and deep neural network. Sci. Rep. 12, 16036. https://doi.org/10.1038/s41598-022-20279-4 (2022).
Article CAS PubMed PubMed Central Google Scholar
Nakagawa, J. et al. Utility of the deep learning technique for the diagnosis of orbital invasion on CT in patients with a nasal or sinonasal tumor. Cancer Imaging 22, 52. https://doi.org/10.1186/s40644-022-00492-0 (2022).
Article PubMed PubMed Central Google Scholar
Kendler, D. L., Lippa, J. & Rootman, J. The initial clinical characteristics of Graves’ orbitopathy vary with age and sex. Arch. Ophthalmol. 111, 197–201. https://doi.org/10.1001/archopht.1993.01090020051022 (1993).
Article CAS PubMed Google Scholar
Santosh, K. C. et al. Machine learning techniques for human age and gender identification based on teeth X-ray images. J. Healthc. Eng. 2022, 8302674. https://doi.org/10.1155/2022/8302674 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bi, Y. et al. Prediction of gender from longitudinal MRI data via deep learning on adolescent data reveals unique patterns associated with brain structure and change over a two-year period. J. Neurosci. Methods 384, 109744. https://doi.org/10.1016/j.jneumeth.2022.109744 (2023).
Article PubMed Google Scholar
Huang, Z., Zhu, X., Ding, M. & Zhang, X. Medical image classification using a light-weighted hybrid neural network based on PCANet and DenseNet. IEEE Access 8, 24697–24712. https://doi.org/10.1109/ACCESS.2020.2971225 (2020).
Article Google Scholar
Chauhan, T., Palivela, H. & Tiwari, S. Optimization and fine-tuning of DenseNet model for classification of COVID-19 cases in medical imaging. Int. J. Inf. Manag. Data Insights 1, 100020. https://doi.org/10.1016/j.jjimei.2021.100020 (2021).
Article Google Scholar
Fu, X., Chen, C. & Li, D. Survival prediction of patients suffering from glioblastoma based on two-branch DenseNet using multi-channel features. Int. J. Comput. Assist. Radiol. Surg. 16, 207–217. https://doi.org/10.1007/s11548-021-02313-4 (2021).
Article PubMed Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778, https://doi.org/10.1109/CVPR.2016.90 (2016).
Sarwinda, D., Paradisa, R. H., Bustamam, A. & Anggia, P. Deep learning in image classification using residual network (ResNet) variants for detection of colorectal cancer. Procedia Comput. Sci. 179, 423–431. https://doi.org/10.1016/j.procs.2021.01.025 (2021).
Article Google Scholar
Abedalla, A., Abdullah, M., Al-Ayyoub, M. & Benkhelifa, E. Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures. PeerJ Comput. Sci. 7, e607. https://doi.org/10.7717/peerj-cs.607 (2021).
Article PubMed PubMed Central Google Scholar
Yoo, H., Han, S. & Chung, K. Diagnosis support model of cardiomegaly based on CNN using ResNet and explainable feature map. IEEE Access 9, 55802–55813. https://doi.org/10.1109/ACCESS.2021.3068597 (2021).
Article Google Scholar
Kim, I. T. & Choi, J. B. Normal range of exophthalmos values on orbit computerized tomography in Koreans. Ophthalmologica 215, 156–162. https://doi.org/10.1159/000050850 (2001).
Article CAS PubMed Google Scholar
Huh, J., Park, S. J. & Lee, J. K. Measurement of proptosis using computed tomography based three-dimensional reconstruction software in patients with Graves’ orbitopathy. Sci. Rep. 10, 14554. https://doi.org/10.1038/s41598-020-71098-4 (2020).
Article CAS ADS PubMed PubMed Central Google Scholar
Barrett, L., Glatt, H. J., Burde, R. M. & Gado, M. H. Optic nerve dysfunction in thyroid eye disease: CT. Radiology 167, 503–507. https://doi.org/10.1148/radiology.167.2.3357962 (1988).
Article CAS PubMed Google Scholar
Giaconi, J. A., Kazim, M., Rho, T. & Pfaff, C. CT scan evidence of dysthyroid optic neuropathy. Ophthalmic Plast. Reconstr. Surg. 18, 177–182. https://doi.org/10.1097/00002341-200205000-00005 (2002).
Article PubMed Google Scholar
Tamboli, D. A., Harris, M. A., Hogg, J. P., Realini, T. & Sivak-Callcott, J. A. Computed tomography dimensions of the lacrimal gland in normal Caucasian orbits. Ophthalmic Plast. Reconstr. Surg. 27, 453–456. https://doi.org/10.1097/IOP.0b013e31821e9f5d (2011).
Article PubMed Google Scholar
Bartley, G. B. & Gorman, C. A. Diagnostic criteria for Graves’ ophthalmopathy. Am. J. Ophthalmol. 119, 792–795. https://doi.org/10.1016/s0002-9394(14)72787-4 (1995).
Article CAS PubMed Google Scholar
Wang, C.-Y. et al. CSPNet: A new backbone that can enhance learning capability of CNN. in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 1571–1580, https://doi.org/10.1109/CVPRW50498.2020.00203 (2020).
Bochkovskiy, A., Wang, C.-Y. & Liao, H.-Y. M. Yolov4: Optimal Speed and Accuracy of Object Detection. arXiv:2004.10934 (2020).
Kong, J. et al. Multi-stream hybrid architecture based on cross-level fusion strategy for fine-grained crop species recognition in precision agriculture. Comput. Electron. Agric. 185, 106134. https://doi.org/10.1016/j.compag.2021.106134 (2021).
Article Google Scholar
Wang, C.-Y., Bochkovskiy, A. & Liao, H.-Y.M. Scaled-YOLOv4: Scaling cross stage partial network. in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13024–13033, https://doi.org/10.1109/CVPR46437.2021.01283 (2021).
Kong, J. et al. A spatial feature-enhanced attention neural network with high-order pooling representation for application in pest and disease recognition. Agriculture 12, 500. https://doi.org/10.3390/agriculture12040500 (2022).
Article CAS Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2261–2269, https://doi.org/10.1109/CVPR.2017.243 (2017).
Selvaraju, R. R. et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. in 2017 IEEE International Conference on Computer Vision (ICCV), 618–626, https://doi.org/10.1109/ICCV.2017.74 (2017).

Download references

Acknowledgements

This research was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korea government (MSIT) (NRF-2021R1A2C1011351) and Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korean government (MSIT) [2021-0-01341, Artificial Intelligence Graduate School Program (Chung-Ang University)]. The funding organization had no role in the design or conduct of this research.

Author information

Authors and Affiliations

Department of Artificial Intelligence, Chung-Ang University, Seoul, Korea
Jaesung Lee & Sanghyuck Lee
AI/ML Research Innovation Center, Chung-Ang University, Seoul, Korea
Jaesung Lee
Department of Ophthalmology, Chung-Ang University College of Medicine, Chung-Ang University Hospital, 102 Heukseok-Ro, Dongjak-Gu, Seoul, 06973, Korea
Won Jun Lee, Nam Ju Moon & Jeong Kyu Lee

Authors

Jaesung Lee
View author publications
Search author on:PubMed Google Scholar
Sanghyuck Lee
View author publications
Search author on:PubMed Google Scholar
Won Jun Lee
View author publications
Search author on:PubMed Google Scholar
Nam Ju Moon
View author publications
Search author on:PubMed Google Scholar
Jeong Kyu Lee
View author publications
Search author on:PubMed Google Scholar

Contributions

J.L. and J.K.L. designed the project, devised the main conceptual ideas, and analysed the data. S.H.L. and N.J.M. collected the data. J.L. and S.H.L. wrote the manuscript. W.J.L. and J.K.L. revised the manuscipt.

Corresponding author

Correspondence to Jeong Kyu Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, J., Lee, S., Lee, W.J. et al. Neural network application for assessing thyroid-associated orbitopathy activity using orbital computed tomography. Sci Rep 13, 13018 (2023). https://doi.org/10.1038/s41598-023-40331-1

Download citation

Received: 05 April 2023
Accepted: 08 August 2023
Published: 10 August 2023
DOI: https://doi.org/10.1038/s41598-023-40331-1