Extended Data Fig. 1: Overview of the data split and downstream analyses performed in this study. | Nature Medicine

Extended Data Fig. 1: Overview of the data split and downstream analyses performed in this study.

From: Prediction of recurrence risk in endometrial cancer with multimodal deep learning

Extended Data Fig. 1

One representative WSI per patient from an Formalin-Fixed Paraffin-Embedded (FFPE) block was included. 20% of cases meeting inclusion criteria were randomly held out for an internal test set (n = 353). The remaining 80% was used for five-cross validation (n = 1,408 patients). This training dataset was enriched with dropped WSIs of FIGO 2009 stage IV cases or those with missing outcome such as the TCGA-UCEC cohort21 for training with self-supervised learning (n = 1,862). Two cohorts were held out as external test sets, the UMCG external test set (n = 160) and the LUMC external test set (n = 151). The LUMC external test set contains up to three FFPE blocks per case. More details for training and data split are provided in Methods. Altogether, including the two training steps and all downstream analyses, this comprehensive analysis comprised data of 2,751 tumors of women. CT, chemotherapy.

Back to article page