Introduction

Acute myeloid leukemia (AML) is a malignant clonal disorder of the hematopoietic system1. Approximately 50% of patients achieve complete remission with induction therapy, but overall survival remains <30% at five years. Further, AML patients who relapse or are refractory to chemotherapy have a dismal prognosis2. Immunotherapies have emerged as a promising therapeutic option in solid cancers, yet efficacy in AML has been disappointing in clinical trials3.

Leukemic cells have been shown to reprogram the surrounding bone marrow microenvironment (BMM) in the course of disease progression, modulating anti-tumor immune response and enabling immune evasion4,5. Filling the AML-immune cell interaction knowledge gap to uncover actionable therapeutics requires deeper insights into how cells within the BMM interact. The advent of high-parameter targeted spatial proteomics technologies6,7,8,9, which capture the spatial distribution of proteins and their expression, has allowed for a richer understanding of the tumor immune microenvironment. Non-spatial technologies such as single-cell RNA sequencing and flow cytometry disaggregate cells and are unable to resolve tissue structure, discarding critical organization of cell interactions such as tertiary lymphoid structures (TLS) which, as part of the adaptive immune response, have been shown to have prognostic implications in cancer through improved response to immune checkpoint therapy10 and increased lymphocyte infiltration11.

Sequential immunofluorescence (IF) is a novel technology that allows for the high-plex interrogation of the spatial proteome. However, the technique comes with unique challenges in formalin-fixed, paraffin-embedded (FFPE) bone marrow tissue: endogenous autofluorescence, tissue and bone deformation through repeated cycles of imaging, and the necessity of decalcification, which degrades morphology and immunoreactivity12.

Given these limitations, we developed a robust sample-to-analysis workflow to spatially capture the proteome of the BMM using an integrative combination of IF and imaging mass cytometry (IMC), maximizing the strengths of each modality and enabling spatial alignment at the single-cell level. Additionally, hematoxylin and eosin (H&E) staining on an adjacent slide captures structural components, granting a comprehensive view of the BMM.

We present an optimized workflow for the high-plex spatial proteomic imaging of leukemic BMM using the COMET sequential IF system to uncover the appearance of a TLS-like aggregate in AML-affected bone marrow and demonstrate the capability for discovering novel AML-BMM interactions with prognostic implications. Furthermore, we provide a resource of validated antibodies for use in bone marrow imaging. These strategies allow high-throughput multimodal spatial characterization and can be applied to other bone marrow diseases beyond AML.

Results

An optimized high-plex spatial proteomics workflow

We developed a robust sample-to-analysis workflow optimized for speed, flexibility, and accurate recapitulation of the bone marrow (Fig. 1). Seven bone marrow biopsies from seven adult AML patients were organized into two TMAs using an automated TMA builder for high-throughput, reduced staining and imaging costs, and reduced batch effect without compromising spatial relationships13. Compared to manual TMA generation, the automated TMA builder provided faster turnaround time and consistent spacing, thickness, and core height. We found a 1.5-mm core diameter to be ideal for tissue adhesion as smaller cores (≤1 mm) tended to detach under our heat-induced epitope retrieval (HIER) protocol (see Methods). The resulting TMA FFPE block was sectioned, mounted onto three standard glass slides for H&E staining, proteomic imaging using the COMET IF and Hyperion XTi IMC systems, and any additional assays. The block was then archived for future use. Section thickness was optimized at 4 µm based on the depth of focus of the COMET as well as signal-to-noise ratio for IF in FFPE tissues14.

Fig. 1: Overview of the multimodal bone marrow imaging pipeline.
figure 1

Patient bone marrow biopsies are ordered based on study suitability and then undergo rigorous quality screening. A digital automated approach for TMA building is used, which increases quality, reproducibility, and speed compared to manual TMA building. The analysis workflow begins by aligning all modalities to a common reference. Imaging data is decomposed into cells by cell segmentation with the U-Net convolutional neural network. Clinical and cell features are aggregated, allowing for comprehensive high-resolution analysis on spatial neighborhoods, structural proximity, and marker enrichment. Figure created in Biorender.

Spatial immunofluorescence in FFPE bone marrow reliably captures the BMM

Using our optimized protocol, we spatially interrogated trephine bone marrows from 7 AML patients (mean age = 67.9 years; range, 62.8–72.55; 43% male) pre- and post-AML directed therapy (Fig. 2a, characteristics detailed in Table 1, mutational groups Supplementary Fig. 1). This analysis involved 16 TMA cores distributed across two TMAs using the COMET approach. A total of 28 antibodies, which included canonical markers for immune cells (n = 12), AML cells (n = 4), and functional markers (n = 12), were applied to each TMA (Supplementary Table 1). To overcome autofluorescence emerging from lipofuscin, red blood cells, and extracellular matrix components such as collagen and elastin15,16, we devised an algorithm to optimally subtract the autofluorescence from the imaged tissue. Briefly, we utilized the autofluorescence quencher TrueBlack and manually adjusted the magnitude of the subtraction by the average endogenous autofluorescence per cycle to obtain a scaled subtraction effect, which recovered lost signal while minimizing noise (Supplementary Fig. 2a-f).

Fig. 2: Immunofluorescent profiling of the bone marrow corresponds with clinical spectral flow.
figure 2

a Oncoprint of the cytogenetics, response, and mutational characteristics of patients 1–7. b Stacked bar plot showing cell type densities by TMA core. c Immunophenotype of AML cells by flow aligns with marker expression in spatial IF. d AML blasts measured by IF significantly correlate with reported blast percentages by flow cytometry (Pearson correlation, R = 0.82, p = 0.00011).

Table 1 Patient characteristics

Out of the segmented 116,488 cells, 63,738 (54.7%) cells had at least one marker that allowed classification/identification (Supplementary Table 2). The remaining cells likely constituted stromal cells, mesenchymal cells, or other immune subsets (such as plasma cells). Protein expression for each cell type is shown in Supplementary Fig. S3a. Cell types were visualized using dimensionality reduction techniques to identify each cell type (Supplementary Fig. 3b, c). Cell densities for each cell type remained consistent within each patient between cores (Fig. 2b). However, distinct molecular and phenotypic differences emerged across patients with similar disease characteristics. TP53 mutations were in three out of seven patients and were associated with abnormal karyotype and partial HLA-DR expression. In contrast, both patients with normal karyotype profile exhibited a CD56(+) AML immunophenotype, despite CD56 expression being commonly associated with the t(8;21) translocation17. Both patients were also the only carriers of the NPM1 mutation in our cohort. No associations with cytogenetics or other mutational profiles were found. AML immunophenotype was also consistent with expression by flow cytometry (Fig. 2c), and blast percentage significantly correlated (R = 0.82, p = 0.00011) with flow cytometry done as part of the clinical diagnostic work-up (Fig. 2d). Overall, spatial IF was able to reliably capture cell populations within the BMM of AML patients. These findings suggest that while there is consistency in cellular density within patients, variations in molecular profiles, such as TP53 mutations and karyotypic abnormalities, may influence immunophenotypic features across patients with comparable disease presentations.

Non-destructive sequential IF allows for proteomic validation using the same slide

To further explore other modalities to guide in the identification of the AML-immune microenvironment, we applied spatial IMC, which uses metal-labeled tagged proteins to avoid spectral overlap and image multiple analytes simultaneously. We applied a panel of 25 markers (Supplementary Table 3) on the same slide that we previously performed IF imaging, taking advantage of the non-destructive nature of the COMET to expand the panel and validate the findings of the COMET.

We first tested whether imaging the tissue with sequential IF would degrade IMC staining. We selected two of the previously COMET-imaged TMA cores (TMA7-3C and TMA7-4C) and compared the staining when subsequently imaged on the same slide using IMC. Antibody staining was highly comparable, and markers targeting the same antigen colocalized well, showcasing retained antigenicity and robustness of alignment (Fig. 3a). Correlation analysis showed markers with high signal to background such as CD163, CD34, and Ki67 tended to cluster closely together with their imaging counterpart, as well markers with expected colocalization such as CD3, CD8, and CD45 (Fig. 3b). Nuclear staining showed some degradation between modalities and affected cell segmentation, leading us to use the 4′,6-diamidino-2-phenylindole (DAPI) staining for nuclear segmentation going forward (Supplementary Fig. 4a, b).

Fig. 3: Spatial IF to IMC results in degradation of nuclear signal in IMC without compromising antigen signal.
figure 3

a Comparison of CD34 (pseudocolored cyan), CD4 (pseudocolored green), CD8 (pseudocolored red), and CD163 (pseudocolored orange) staining performance in IF and IMC shows high concordance, validating cell segmentation and spatial alignment approaches. Scale bar, 25 µm. b Correlation of the same slide imaging between IF and IMC markers. c Alluvial plot showing proportions of concordance and discordance of the same cells phenotyped using only IMC markers and using only IF markers. d Distance of AML cells to granzyme K-negative and granzyme K-positive cells. Connected dots represent the distances for the same AML blast cell (paired Wilcoxon test, p < 2.2e-16).

To further evaluate IF to IMC performance, we separately phenotyped the cells iteratively captured from IF and IMC to compare how cell assignment differed (Supplementary Table 4). Canonical markers retained expected expression across cell types in both modalities (Supplementary Fig. 4a). Identities of CD34(+) blasts, and CD8(+) and CD4(+) T cells were generally concordant, with IMC displaying higher specificity (82%, 79%, and 71% concordance of IMC positive populations with IF, respectively) than IF (60%, 52%, and 40% concordance) (Fig. 3c).

Monocyte/macrophage markers were more discordant, with less than 50% of cells retaining phenotypic assignment between modalities (IMC 47% concordance, IF 41% concordance). Since macrophages have large, irregular membranes that often overlap with other cells, the phenotyping is more sensitive to differences in intensity thresholds, cell segmentation, and alignment. This issue is exacerbated in nucleus-based cell segmentation, which captures the morphology of small, round cells like lymphocytes better than large, irregular, or multinucleated cells. Overall, results on IMC were able to validate our findings in IF, and we found concordance between the two modalities high enough to allow for possible panel expansion using this method in the future.

With subsequent IMC staining, we were able to image additional markers without increasing COMET cycle count. We previously published that a subset of stem-like/memory CD8 T cells expressing granzyme K (GZMK) was enriched in responders to immunotherapy in AML18. We hypothesized that AML cells could have a spatial relationship with GZMK(+) CD8(+) T cells. Using the expanded panel provided by IMC, we found that CD34(+) AML cells were significantly closer to GZMK(-) CD8(+) T cells than GZMK(+) CD8 T(+) cells (Fig. 3d). This could possibly indicate an immune-evasive response to T cells with memory-like properties, which are more likely to trigger bystander activation in inflammatory environments19.

Spatial region analysis unveils tertiary lymphoid-like aggregates in adult AML

Recent developments in spatial neighborhood analysis have allowed for the identification of regions based on patterns of local cell proximity, identifying novel areas of interaction and structure with potential prognostic/therapeutic implications20. To test this in AML, we applied unbiased clustering over Ripley’s K function, a statistical measure of the spatial dispersion of each cell type modeled as a local point process. This revealed five distinct regions distinguished by different proportions of cell types, suggesting unique cell-cell interactions (Fig. 4a, b, Supplementary Fig. 5). Region_1 was highly enriched in AML blasts and significantly correlated to flow cytometry blast percentage (R = 0.92, p = 3e-7) (Supplementary Fig. 6a). Region_2 was primarily enriched in lymphocytes and cells of monocyte and macrophage lineage. Region_3 had a high concentration of B cells, CD4(+) T cells, CD8(+) T cells, and NK cells. Region_4 was an even mix of cell types, and region_5 captured mainly unidentified cells.

Fig. 4: Spatial neighborhoods reveal tertiary lymphoid-like aggregates.
figure 4

a Unbiased clustering of the spatial K function for each cell type reveals five different cellular regions of varying proportions per tissue core. Of note, region_3 only appears in three cores. b Relative enrichment of cell types for each cellular region. c Region assignment of cells within the TLS-like structures, with corresponding micrographs. CD34(+) AML cells are shown in pseudocolored cyan, CD4(+) T cells in pseudocolored green, and CD20(+) B cells in pseudocolored red. DAPI (pseudocolored blue) was used as a nuclear counterstain. Scale bar, 100 µm.

Region_4 was the largest in area, as expected from the unstructured liquid formation of the BMM. Interestingly, region_3 only appeared in tumor cores of 3/7 patients (TMA7-3C, TMA8-3C, and TMA8-5E) and consisted of a tight cluster of B and T lymphocytes (Fig. 4c). These aggregates varied in size and organization, with TMA8-3C having the largest aggregate and rough organization of a B/CD4 T cell core and CD4/CD8 T cell border, while TMA8-3C and TMA8-5E were much smaller and contained a thoroughly mixed B and T cell population. This finding was consistent with various states of maturity of TLS found in solid tumors based on composition and organization. B cells in region_3 also displayed significantly higher expression of Ki67 than in region_2 (p = 0.047) and region_4 (p = 1.46e-4), possibly indicative of a proliferative B-cell core, a previously reported characteristic of TLS in solid cancers (Supplementary Fig. 6b). We then stratified patients by presence of region_3 aggregates. Looking at differences between cell densities, we were surprised to find that patients with region_3 aggregates had significantly higher densities of monocyte and macrophage lineage cells (p = 0.0164), and while T and B cells were slightly higher in density, they were not significantly so (Fig. 5a). While overall densities were similar, patients with region_3 aggregates exhibited a significantly higher percentage of CD4(+) (p = 0.017) and CD8(+) T cells (p = 0.041) in the AML-enriched region_1, analogous to increased tumor lymphocyte infiltration in solid tumors (Fig. 5b). We then were able to validate the presence of these aggregates through the 7-color (Supplementary Table 5) Opal multiplex IF assay by staining subsequent sections of the original TMA blocks (Fig. 5c, Supplementary Fig. 6c). We also tested alternate methods for neighborhood detection and found similar results, with less specificity for region_3 likely due to focusing on local spatial patterns (Supplementary Figs. 79)8,20,21. TLS presence in AML remains poorly studied, with most research focusing on solid tumors. Recently, a study has described these clusters of lymphocytes as “TLS-like aggregates” in a pediatric AML cohort22. As such, we chose to adapt this terminology to reflect the lack of conclusive definitions of TLS in AML. In summary, by clustering local measures of spatial association, we were able to spatially identify a cluster of cells, denoted by region_3, of lymphocyte aggregates that resemble various stages of TLS formation and maturity in AML. These aggregates were further validated by the Opal assay.

Fig. 5: Spatial analysis reveals microenvironmental differences with lymphocyte aggregates.
figure 5

a Cell densities for B cells, CD4(+) T cells (CD4T), CD8(+) T cells (CD8T), and monocyte-macrophage lineage cells (MonoMacLin). Each dot represents an individual TMA core (Wilcoxon test). b Percent of B cells, CD8(+) T cells, and CD4(+) T cells in region_1 (AML-enriched) areas between TMA cores with and without region_3 aggregates. Dots represent an individual TMA core (Wilcoxon test). c Micrographs showing COMET stain and Opal validation stain for the region_3 aggregate of TMA8-3C. The two methods showed similar staining patterns for CD4 (pseudocolored green), CD8 (pseudocolored red), and CD34 (pseudocolored cyan). DAPI (pseudocolored blue) was used as a nuclear counterstain. Scale bar, 100 µm.

TLS signatures identified in Bulk RNA-seq and Spatial transcriptomics AML datasets are correlated with prognosis

Though we were able to discover TLS-like aggregates with the COMET, spatial proteomics is relatively low-plex compared to RNA-seq, and our panel excluded many functional TLS markers. Using 6 published gene signatures for TLS23,24,25,26,27,28 (Supplementary Table 6), we investigated the presence of TLS in AML using the Visium (10x Genomics, Pleasanton, CA) spatial transcriptomics platform, which can capture the transcriptome of the AML BMM at a resolution of 55 µm spots. We focused on the signature from Cabrita et al. as it is derived from a compendium of TLS-associated genes23 and not cancer-specific. We performed the Visium assay on an archived bone marrow sample with a positive lymphocyte aggregate by H&E stain. Using the local Lee’s L bivariate spatial statistic29, we found highly concordant spatial enrichment of the TLS signature by Cabrita et al. with deconvolution signatures of T cells and B cells, further suggesting TLS-like properties for these lymphocyte aggregates in AML30 (Fig. 6a).

Fig. 6: TLS signatures show prognostic value in transcriptomics.
figure 6

a Spatial transcriptomics of Cabrita et al. TLS signature shown with the Local Lee’s L spatial statistic when using a B cell signature as a covariate, and when using a CD4 effector T cell signature as a covariate, along with the correlation between signatures in the TLS-like ROI (Pearson correlation test). b Kaplan–Meier survival curves of AML patients from TCGA, Beat-AML, and MDACC by median score of the Hou et al. TLS signature. c Kaplan–Meier survival curves of AML patients from TCGA, Beat-AML, and MDACC by median score of the Cabrita et al. TLS signature. d Kaplan–Meier survival curves of AML patients from TCGA, Beat-AML, and MDACC by median HLA-E expression. e Correlation plot of Cabrita et al. TLS signature and significantly correlated Hallmark signatures (Pearson correlation test).

We further investigated the presence of TLS in AML using a bulk RNA-seq dataset integrating data from TCGA31, MDACC32, and BEAT-AML1/233 datasets and found signatures published by Cabrita et al. and Hou et al. to be predictive of overall survival (Fig. 6b, c). Additionally, in the 3/4 of the datasets, all 6 signatures significantly correlated with HLA-E expression, which we previously identified as a marker for immune evasion in AML34, and was significantly indicative of survival in the combined bulk RNA dataset (Supplementary Fig. 10a and Fig. 6d). The TLS signature also significantly correlated with hallmark scores for hypoxia, interferon-alpha (IFN-α) response, interferon-gamma (IFN-γ) response, inflammatory response, and angiogenesis in our AML dataset (Fig. 6e). When looking across the entire TCGA dataset, median TLS score for AML was relatively low (−0.45, rank 24/32), suggesting rarer or subdued TLS signaling programs in AML (Supplementary Fig. 10b). Diffuse large B-cell lymphoma had the highest median TLS score (0.76), possibly due to aberrant CCR7 expression in malignant B cells35.

Discussion

Uncovering AML-BMM interactions is essential to untangling the complex interplay between leukemia and immunotherapy. Comprehensive characterization of the BMM is necessary to determine cell states and interactions, and requires targeting of tumor, immune, and structural components to uncover deeper spatial characteristics.

AML is a heterogeneous disease, with a diverse range of phenotypic markers that contain a high overlap with normal hematopoietic cells. Currently, no singular marker can confidently phenotype AML blast cells, and clinical diagnostic flow cytometry panels for AML exceed 20+ markers to differentiate AML immunophenotypes, which carry diagnostic and prognostic implications2. Other groups have combined multiple targeted spatial imaging modalities for same slide imaging to improve cell segmentation or augment cell identification and analysis36,37,38. We employed IMC to validate our findings in IF, with the additional benefit of expanding the panel size without subjecting the tissue to additional imaging cycles. By using an adjacent H&E-stained slide as reference, we can align to a minimally treated image that reduces tissue damage or bone detachment from antigen retrieval and multiple staining rounds39. This is particularly important in cyclic imaging technologies, which require morphologic similarity at the micron level over multiple captures for accurate image registration, cell segmentation, and background subtraction40. Furthermore, lack of autofluorescence in IMC makes it ideal for targeting cells with high lipofuscin or pigment40, while high-resolution and lack of conjugation in COMET make it suitable for rapidly querying biomarkers of interest and interrogating subcellular expression6. Additionally, other structural stains relevant to bone marrow pathology, such as trichrome and silver stains, can be performed. Overall, our study finds sequential IF and IMC reliable methods for interrogating the BMM, with good concordance after alignment.

By clustering the spatial K function of each cell, we were able to reveal B and T cell aggregates that resembled various maturation states of TLS in solid tumors41. Current definitions of TLS are variable, and several delineations of TLS maturity and development have been proposed, with the most immature state being a loose lymphocyte aggregate while the most mature feature an organized cluster of lymphocytes with a germinal center, high endothelial venules, T follicular helper cells, and follicular dendritic cells41. Lymphocyte aggregates have long been observed in bone marrow, being described on a spectrum from disorganized clusters to structures resembling classical TLS. Their effect on the AML BMM however has not been thoroughly detailed42,43. These “TLS-like aggregates” were recently observed in a pediatric AML population and found to be increased in number after immunotherapy22. Interestingly, all of the patients in our study exhibiting TLS-like aggregates (n = 3) were positive for TP53 mutation, while their cohort was enriched for KMT2A rearrangement. Tumor protein p53 (TP53) is a well-known cancer driver that has been associated with TLS signatures in several cancers and corresponds with poor prognosis in AML44,45. TLS is thought to be formed under chronic inflammatory conditions46, and similarly, AML progression and pathogenesis have been linked to inflammation47. Our previous work implicated HLA-E as an immune evasion reaction of AML cells upregulated in response to inflammatory cytokines such as IFN-γ34. This finding is corroborated by the strong correlation between HLA-E expression and a validated TLS score in a large, integrated public AML dataset. Our research shows that the transcriptomic signature of these TLS-like aggregates can be observed spatially and corresponds with better survival, similar to that of TLS in solid cancers25,48,49. Though in the present study we lack the ability to detect features of mature TLS, our integration of multiple spatial proteomics, spatial transcriptomics, and bulk RNA assays suggests that the presence of bona-fide TLS may exist in AML.

Currently, many questions remain unanswered about the state of TLS-like and lymphocyte aggregates in AML. Lymphocyte aggregates in bone marrow are relatively rare in occurrence, estimated to be between 1 and 13%50. Even more rare, Agbay et al. reported less than 0.02% incidence of reactive germinal centers in 205,274 bone marrow specimens, and none in the context of AML51. In contrast, TLS is reported to occur in up to 60% of breast cancer cases52. Several models of TLS maturation suggest lymphocyte aggregates as the initial steps to TLS formation41,53. Though only three aggregates were found in the present study, only one had characteristics resembling a mature TLS. As previous studies have shown that only mature TLS exhibits favorable prognostic value44,48, understanding mechanism of TLS formation and maturation in AML compared to other cancers is crucial. Currently, the value of inducing TLS remains uncertain. The stimulator of interferon genes (STING) agonist ADU S-100 has been reported to promote TLS formation through lymphotoxin-alpha (LTα), interleukin-36 beta (IL-36β), and tumor necrosis factor-alpha (TNF-α) in a melanoma mouse model54, however, clinical trials of STING agonists have had disappointing results55 with companies halting development on STING agonists GSK3745417, MK-1454, and ADU-S-100. Additional areas of investigation include prognostic features of TLS, which include TLS maturity, TLS density, and intra- or peri-tumor ___location in solid tumors56. Though not all these features are directly applicable to liquid malignancies, possible analogs in the AML BMM could include size, proximity, and density of TLS around AML-enriched spatial neighborhoods. Given the potential impact on immune response and scarcity of research on TLS in AML, further study into these TLS-like aggregates, their factors of formation and maturation, their interaction with the BMM, and their effect on immunotherapy is warranted.

In summary, we were able to characterize a TLS-like aggregate in AML patients through an optimized, flexible, and cost-effective pipeline performant enough to spatially interrogate the leukemic BMM at the single-cell level using the same slide IF and IMC. We were able to further validate the presence of TLS-like aggregates using established TLS signatures in spatial transcriptomics and public AML datasets and show correlations to prognostic impact. Further research into TLS-like aggregates in the BMM could help unveil spatial factors of durable immune response in AML.

Methods

TMA preparation

16 bone marrow biopsies from 7 patients were retrieved from the Department of Hematopathology archives at the University of Texas MD Anderson Cancer Center and organized into two tissue microarrays (TMAs). Patients gave informed consent for sample collection. The study protocol was approved by MD Anderson’s Institutional Review Board and was conducted in accordance with the principles of the Declaration of Helsinki. Regions of interest (ROIs) for each sample were annotated using the TMA control software to select for focal areas of cellularity (3DHISTECH, Budapest, Hungary). A FFPE TMA block was created by embedding a 1.5-mm diameter circular core from each donor block with 0.5 mm spacing using the TMA Grand Master (3DHISTECH), which was sectioned at 4 µm to create a series of TMA slides.

COMET multiplex IF

FFPE TMA slides underwent dewax and hydration largely as previously described57. Slides then underwent HIER at 107 °C for 15 minutes. For testing of the Lunaphore protocol, dewax and antigen retrieval were performed using the PT module (Epredia, Kalamazoo, MI) as previously described6. For testing of the TrueBlack, TrueBlack lipofuscin autofluorescence quencher (Biotium, Fremont, CA) was applied according to manufacturer directions. The slide was then loaded into the COMET to fit in the 9×9 mm square imaging window. Staining, imaging, and elution were performed largely as previously described6. Primary antibodies and dilutions are found in Supplementary Table 1. Imaged slides were stored in PBS at 4 °C until ready to stain for IMC.

Opal multiplex IF

FFPE TMA slides from the same blocks were baked, dewaxed, and treated with 3% peroxide. Tissue then underwent the 7-color Opal multiplex staining workflow. In short, slides underwent HIER, blocked, and incubated with primary antibodies. Slides were then incubated with horseradish peroxidase, then Opal fluorophores, stripping out the antibodies between each marker before counterstaining with DAPI and mounting with a coverslip. All steps were performed on the NanoVIP 100 automated stainer (BioGenex, Fremont, CA). Slides were then imaged at ×40 using the PhenoImager HT 2.0 (Akoya Biosciences, Marlborough, MA).

IMC staining and imaging

Briefly, the TMA slide was treated with blocking buffer and permeabilization reagents, incubated in a cocktail of metal-conjugated antibodies, and counter-stained with iridium intercalator (201192B; Standard BioTools, San Francisco, CA). After drying, ROIs were drawn to select COMET-imaged cores, which were imaged at ×10 using the Hyperion XTi (Standard BioTools). The resulting MCD files were converted into TIFF by MCD Viewer (v1.0.560.6; Standard BioTools).

Image analysis

Image files were analyzed using the Visiopharm image analysis software (v2023.09; Visiopharm, Hoersholm, Denmark). COMET IF, IMC, and H&E images were aligned using the TissueAlign function guided by major tissue landmarks and coexpression of markers across assays. Detection of hematopoietic tissue and bone tissue was performed using a random forest classifier on the DAPI stain and the H&E stain, respectively. Areas with necrosis, crushed cells, poor focus, misaligned stitching, or high autofluorescence were removed. COMET image background subtraction was performed by subtracting the autofluorescence captures scaled by the average autofluorescence of previous runs per channel from the marker image. Cell segmentation was achieved by detecting cell nuclei using DAPI or iridium counterstains with a deep learning algorithm pre-trained on a U-Net convolutional neural network. Cells were classified using a supervised gating strategy based on mean intensity thresholds applied to canonical markers, while leukemic blasts were annotated using patient-specific immunophenotypes obtained by flow cytometry. For validation using the Opal staining workflow, cores with lymphocyte aggregates in the COMET assay were selected for comparison of staining. Cell labels, mean fluorescence intensities, and XY positions were exported.

Visium

Briefly, FFPE bone marrow samples were processed using the standard CytAssist protocol from 10x Genomics. SpaceRanger (v2.0) was used to align sequences to the reference genome and to map spots to the tissue. Spots were filtered by median absolute deviation, normalized using SCTransform(), and variable genes were identified using FindVariableFeatures() in Seurat58. Cell deconvolution used the Seurat transfer learning pipeline, leveraging in-house single-cell RNA-seq data. TLS signatures were scored using the AUCell R package59. Local Lee’s L was computed by adapting methods from the Voyager R package30. All software used is listed in Supplementary Table 7.

Bulk data

RNA-seq data and corresponding clinical data from TCGA, Beat-AML, and MDACC were downloaded and integrated into one dataset. Batch effects were corrected using ComBat-seq60. TLS signatures obtained from the literature were used to score the data using the R package GSVA61.

Statistical analysis

All statistical analyses were performed in the R statistical software (v4.3.0) using the rstatix R package. Marker expression was arcsinh-transformed, batch-corrected with Harmony62, and analyzed using the imcRtools package63. Spatial regions were generated with the lisaClust package64. Cell distances were calculated using the minDistToCells() function. A two-sided Wilcoxon rank sum test was used for comparisons between two independent groups, while the paired Wilcoxon test was used for paired data. Survival curves were created using the Kaplan–Meier method and assessed using the log-rank test. Pearson’s correlation was used for all correlative tests. All statistical tests used an alpha of 0.05.

Data visualization

The pipeline workflow was created in BioRender (BioRender, Toronto, Canada). Figures were constructed using the ComplexHeatmap65, pheatmap, ggplot2, ggpubr, and dittoSeq66 R packages and composed in Illustrator (v28.1; Adobe, San Jose, CA). All boxplots use median for the center line. Micrographs were created by cropping original image files, then layering and applying pseudocolor in Photoshop (v25.5.1; Adobe) through linear mapping. Whole image brightness adjustments (levels) were made for visibility.