Background

Approximately 70% of oropharyngeal squamous cell carcinomas (OPSCC) are caused by infections with human papillomaviruses (HPV) [1]. In clinical routine, the expression of p16 serves as a surrogate parameter with high sensitivity and specificity for HPV [2, 3]. Studies have demonstrated that p16-positive OPSCC exhibit superior survival rates and better responses to radiotherapy compared to p16-negative OPSCC [4,5,6]. Consequently, the latest World Health Organization (WHO) classification for head and neck tumors and the 8th edition of the TNM Classification of Malignant Tumors distinguish p16-positive OPSCC as a separate disease entity [3, 7, 8]. Comprehensive genomic profiling of large head and neck cancer cohorts supports the distinct tumor biology of HPV-positive versus HPV-negative subtypes [9]. Given the comparatively favorable prognosis of p16-positive OPSCC and the severe side effects associated with radiotherapy, treatment de-escalation strategies of therapeutic regimens are under investigation [10, 11]. However, attempts to validate these strategies have not been successful to date [12]. It is worth noting that approximately 20% of HPV-positive OPSCC cases elicit poor outcomes, and currently, no biomarkers have been established to identify these high-risk cases [13].

In previous studies, our group and others validated tumor budding (TB) as a prognostic biomarker in HPV-negative head and neck squamous cell carcinomas (HNSCC) [14, 15]. High TB was associated with inferior disease-free and overall survival (OS) as well as increased metastatic spread to lymph nodes [16]. In contrast, studies on the prognostic implications of TB in HPV-positive tumors are scarce [17].

In a more recent analysis of the TCGA HNSCC cohort, we identified TB as a negative prognostic biomarker not only in HPV-negative tumors but also in the HPV-positive subgroup [18]. Another potential prognostic factor in HNSCC comprises the evaluation of the tumor-stroma-ratio (TSR) which seems to be of prognostic significance in head and neck cancer including tumors of the oropharynx [19, 20]. The prognostic significance of the TSR was especially shown for cancers of the oral cavity where stroma-rich tumors are associated with a reduced disease-specific and disease-free survival [19]. However, in a meta-analysis Almangush et al. detected some degree of heterogeneity in the determination of the disease-specific survival [19]. Furthermore, the prognostic significance of the TSR in other head and neck subsites (e.g., of the nasopharynx or larynx) is unclear due to a limited number of studies published so far [21].

Furthermore, evaluating tumor-infiltrating lymphocytes (TILs) might help in the prognostic stratification of OPSCC as high amounts of TILs are linked to improved survival outcomes in these tumors [22].

In our study, we aimed to validate the prognostic significance of TB and to probe the prognostic impact of other morphological parameters (TSR, TILs) in a large multicenter cohort of p16-positive HPV-associated OPSCC. We included n = 275 p16-positive OPSCC from five pathology departments in Germany (Friedrich-Alexander University Erlangen-Nuremberg, University of Heidelberg, University of Lübeck, Christian-Albrechts-University Kiel, Technical University of Munich), rendering the cohort the up to date largest cohort analyzing TB in HPV-associated OPSCC. Our retrospective study contributes to the identification of biomarkers that could guide therapy and, specifically, aid in selecting patients for treatment de-escalation in HPV-related OPSCC.

Material and methods

Patient cohort

Included in the study cohort were resection specimens of OPSCC whereas biopsies were not considered applicable. The initial study cohort consisted of 299 patients with p16-positive squamous cell carcinomas originating in the oropharynx. A total of n = 24 had to be excluded from the histomorphological analysis as e.g. too less invasive tumor for the evaluation of tumor budding was contained on the digitized slides or the slides contained a lymph node metastasis of the resection specimen and not the primary tumor.

The final study cohort consisted solely of resection specimens of n = 275 primary, treatment-naive OPSCC. Cases were enrolled from five pathology departments in Germany - the Friedrich-Alexander University Erlangen-Nuremberg, University of Heidelberg, University of Lübeck, Christian-Albrechts-University Kiel and the Technical University of Munich - rendering the study design a retrospective multicenter study. Resections were performed between 1991 and 2023 (median 2015, interquartile range 5 years). For routine diagnostic pathology purposes, formalin-fixed, paraffin embedded (FFPE) tumor tissue blocks and subsequently Hematoxylin-Eosin (H&E) stains were created. For our study, OPSCC patients were identified in pathology archives and were included in a consecutive manner. As p16-positivity is a widely accepted surrogate marker for HPV-association in case of HNSCC located within lymphoepithelial regions within the oropharynx [3, 23, 24], p16-staining was performed in each center individually and after evaluation of the stain, patients were enrolled based on presence of p16-positivity with p16-negative OPSCC counterparts being excluded.

In accordance with the guideline from the College of American Pathologists, moderate to strong p16-expression in at least 70% of tumor cells was regarded as positive (Fig. 1; [23]). The analysis of immunohistochemical p16 stains was performed locally in each university pathology department and evaluated on glass slides by board-certified pathologists with long experience in head and neck pathology named including affiliations in our author list.

Fig. 1: Microscopic illustration of a case with basaloid appearance and strong diffuse p16 expression.
figure 1

a Overview over a OPSCC in low magnification; upper image: H&E, lower image: p16 immunohistochemistry; scalebar in both images: 200 µm. b High magnification view of selected areas with tumor buds highlighted by arrows. (upper image: H&E, lower image: p16 immunohistochemistry; scalebar in bith images: 20 µm). OPSCC: oropharyngeal squamous cell carcinoma; H&E: Hematoxylin-Eosin stain.

Median follow-up periods were 48 months for OS and 44.3 months for progression-free survival (PFS). To facilitate the comparability of cases all cases were staged according to the 4th edition of the “WHO classification of Head and Neck Tumors” [3]. Detailed clinicopathological characteristics of the study cohort, which were collected separately in each university center in a joint clinico-pathological effort by accordingly qualified medical doctors as named including affiliations in our author list, are given in Supplementary Tables 1 and 2.

Histological analyses

For histological analyses, routine H&E stained tumor sections were digitized by scanning with 40x magnification and saved as *.svs file. One digitized H&E-stained slide per case with unambiguous squamous cell carcinoma morphology was provided by all five participating centers and was evaluated by two experienced pathologists (FS and MB) blinded to clinicopathological data. Cases were subtyped in analogy to the WHO classification for p16-negative HNSCC in conventional (keratinizing vs. non-keratinizing) and basaloid morphology (Fig. 1) [3]. In analogy with previous studies in HNSCC, TB was defined as the detachment of four or less tumor cells from the main tumor mass, which infiltrated the adjacent stroma [18, 25]. TB was counted irrespective of the ___location within the tumor, i.e., tumor center or periphery. We analyzed TB in 1 high-power field (HPF) with the highest amount of buds (“hotspot”) and ten consecutive HPF with 1 HPF covering an area of 97464 µm2 and corresponding to a field diameter of 0.35 mm in light microscopy.

The stroma content (evaluated as TSR) was assessed as the proportion of the tumor area covered by stroma with regard to the whole tumor area. Areas of necrosis were excluded from the assessment.

TILs evaluation was based on the assessment of stromal TILs. The stromal area covered by TILs (under the exclusion of intraepithelial TILs) was evaluated in relation to the whole tumor stroma [26].

Statistical analyses

For the morphological parameters TB, stroma content and immune infiltrate, “Cutoff Finder” was applied to identify the prognostically most relevant cutoffs [27].

Ordinally scaled data were compared with Fisher’s exact test, whereas continuous data were compared with the Mann–Whitney-U-Test. Mutual correlations were calculated with Spearman’s correlation coefficient. Survival outcomes between groups were analyzed with the Kaplan–Meier method and survival differences were calculated with the Log-Rank test. For univariate and multivariate survival analysis a Cox Regression model was applied.

All tests were calculated two-sided with p-values < 0.05 regarded as significant. For statistical analyses R (version 4.1.3) was used.

Results

Histological analyses

Detailed clinicopathological characteristics of the n = 275 patients comprising the study cohort are shown in Supplemental Table 1. All tumors were located in the oropharynx, with n = 95 (34.5%) originating within the tonsils, n = 17 (6.2%) originating from the base of the tongue, n = 2 (0.7%) originating from the lateral oropharyngeal wall, n = 1 (0.4%) originating from the soft palate and the remaining n = 160 (58.2%) were not assigned to a specific anatomic subsite.

Basic characteristics for TB (mean, median, minimum, maximum) in 1 HPF, TB in 10 HPF, stroma content and TILs are shown in Supplementary Table 2. A total of n = 134 tumors (48.7%) showed no tumor budding at all, whereas in n = 141 cases (51.3%) TB was observed in varying degrees.

To determine the cutoffs for each biomarker with the strongest prognostic impact on OS we applied an online tool for biomarker cutoff optimization “Cutoff Finder”, described in detail in ref. [27]. The optimal cutpoints to separate low-risk from high-risk cases were those with the lowest p-values in a subsequent log-rank test. The following cutpoints for high-risk cases for poor outcome were calculated: For TB in 1 HPF OPSCC with ≥4 buds showed significantly worse OS compared to counterparts with <4 buds, for TB in 10 HPF as well ≥4 buds provided an optimal discrimination and for TILs cases with ≥53% showed a significantly better OS. For the TSR no prognostically significant cutoff was identified.

All tested cutoffs for TB (ranging from 1 to 12) yielded a prognostically significant stratification (Supplementary Fig. 1).

As both the optimal cutoff value (4 buds) and the applied cutoff value (6 buds) showed similar hazard ratios we classified the cases according to TB in 10 HPF and applied the cutoffs identified in our previous publication on HNSCC to facilitate the comparability with our previous study [18]. To this end, high TB was defined as detection of at least 6 buds for the 10 HPF methods and as detection of at least 2 buds for the 1 HPF method. For high TILs the cutoff ≥53% was applied.

Clinicopathologic correlations

65 cases (23.6%) were classified as TB-high and 93 cases (33.8%) as TILs-high.

TB evaluated in 1 HPF and TB evaluated in 10 HPF were strongly correlated (Spearman’s rho = 0.98, p < 0.001). Analyzing 10 HPF cases with high TB were less often classified as pT1/2 and more often classified as UICC stage III/IV.

Neither TB in 1 HPF (Spearman’s rho = 0.14, p = 0.054) nor in 10 HPF (Spearman’s rho = 0.13, p = 0.068) were associated with the number of positive lymph nodes nor with the pN0/pN+ stage. For n = 43 patients, information regarding an extracapsular extension of a lymph node metastasis was available but we could not detect an association with TB.

Cases with high TILs were more often classified as pT1 or pT2, more often pN0 and more often UICC stage I/II.

The number of buds in 1/10 HPF showed a weak negative correlation with the amount of TILs (Spearman’s rho: −0.144/−0.137, p = 0.017/0.023) and a weak positive correlation with the stroma content (Spearman’s rho: 0.214/0.207, p < 0.001 each). No correlation between the amount of TILs and the stroma content was detected.

Clinicopathological correlations are summarized in Table 1.

Table 1 Clinicopathological characteristics of the cohort after stratification into TB low and TB high according to the cutoff for 10 HPF.

Survival analysis

In univariate survival analysis (OS) age, UICC stage IV, R1/R2-resection status, LVI and TB were associated with inferior outcomes while high levels of TILs were associated with a lower hazard ratio (Table 2, Supplementary Figs. 2, 3). Cases with high TB and LVI were also associated with inferior survival outcomes regarding PFS (Supplementary Table 3, Supplementary Figs. 2, 3). Median survival times for OS (PFS) for high TB were 116.0 (138.0) months. Median survival times for OS for absence of LVI were 256.0 months and 259.0 months for presence of LVI. For low TB (OS, PFS) and LVI (PFS) no median survival times could be calculated as the survival probability remained >0.5.

Table 2 Results for the univariate analysis are shown for TB assessment in 10 HPF and for overall survival.

Both the year of surgical treatment and the center of clinical care were not associated with survival outcomes.

In multivariate analysis age and sex were included along with the significant parameters of the univariate analysis (UICC stage, resection status, LVI, TILs, TB). For OS age, LVI and TB remained as independent prognostic factors (Table 3) whereas for PFS only LVI and TB remained as independent prognostic factors (Supplementary Table 4).

Table 3 Results for the multivariate analysis are shown for TB assessment in 10 HPF and for overall survival.

Combined classification of TB and LVI

Applying the combined prognostic stratification scheme composed of TB in 10 HPF and LVI we obtained three subgroups: a) Cases with low TB and absence of LVI (n = 172, 62.5%), b) cases with low TB and presence of LVI or cases with high TB and absence of LVI (n = 90, 32.7%) and c) cases with high TB and presence of LVI (n = 13, 4.7%).

In univariate analysis (OS) the combined prognostic stratification scheme yielded an intermediate risk for inferior outcomes for cases with either high TB or presence of LVI (HR: 2.18, 95% CI: 1.16–4.11, p = 0.016) and the highest risk for cases with both high TB and the presence of LVI (HR: 9.78, 95% CI: 3.99–24.01, p < 0.001, Fig. 2). Median survival times for OS (PFS) were 164.3 months for the intermediate-risk group and 28.1 (23.0) months for the high-risk group. For the low-risk group (OS and PFS) and the intermediate-risk group (PFS) the median survival times could not be calculated (survival probability remained >0.5).

Fig. 2: Kaplan–Meier plots for illustration of survival of patients depending on TB and LVI status.
figure 2

a Overall survival (b) Progression free survival. Patients were stratified according to TB in 10 HPF and the absence or presence of LVI. HPF High-power field, L0 Lymphovascular invasion absent, L1 Lymphovascular invasion present, LVI Lymphovascular invasion, TB Tumor budding.

In the multivariate analysis (OS), which included age, sex, UICC stage, resection status, and TILs, patient age and the combined classification were identified as independent prognostic factors. However, for PFS, only the combined classification (TB and LVI) remained as an independent prognostic factor (Table 4).

Table 4 Results for the multivariate analysis.

For a subset of n = 115 cases (41.8%) the exact anatomic localization of the tumor was available. According to Tham et al. tumors located in the tonsils or base of tongue (n = 112) were considered tonsillar related areas whereas the tumors located at the lateral oropharyngeal wall and soft palate (n = 3) were considered to arise in a non-tonsillar related area [28].

Subsequently analyzing the combined stratification scheme (TB + LVI) with regards to tonsillar and non-tonsillar related areas and overall survival we observed that the combined scheme can prognostically stratify tumors of the tonsillar related areas. Here, in univariate analysis cases with high TB/absence of LVI or presence of LVI/low TB were allocated to the intermediate risk group (HR: 1.62, 95% CI: 0.33–8.04, p = 0.557). Cases with high TB and presence of LVI were allocated to the high-risk group (HR: 16.54, 95% CI: 2.64–103.66, p = 0.003). Among tonsillar related tumors similar results were obtained for cases with high TB and presence of LVI in multivariate analysis of overall survival (including age, sex, UICC stage, resection status and the combined scheme of TB and LVI) (HR: 29.00, 95% CI: 2.45–342.96, p = 0.008; Supplementary Table 5). Median survival times for tonsillar related and nontonsillar related areas are shown in Supplementary Table 6.

For non-tonsillar related areas no prognostic stratification considering the combined stratification scheme (TB + LVI) could be calculated (only n = 3 cases).

Discussion

In 2017, the WHO classification of tumors of the head and neck separated HPV-associated OPSCC from HPV-independent counterparts [3] and this classification was adopted in the current WHO classification [24]. The rationale for this distinction is based on the evidence that these entities - HPV-positive versus HPV-negative OPSCC - exhibit differences in epidemiological, molecular, biological and prognostic characteristics [29,30,31]. Among HPV-positive OPSCC, a subgroup of approximately 20% shows a comparably poor prognosis linked to early cancer recurrences and metastasis [31, 32]. However, currently no established biomarker effectively distinguishes these high-risk cases from low-risk cases [31].

Aiming for improved patient stratification, we evaluated the feasibility of TB alongside other promising morphological parameters to identify low-risk and high-risk p16-positive OPSCC. Particularly in focus were our previously published TB grading scheme [18] as well as TIL. Furthermore, we evaluated TSR, which was shown to be a prognostic biomarker in HNSCC including OPSCC [19, 20]. After TB was identified as a tissue-based biomarker with significant prognostic value, we then aimed to improve the prognostic stratification by combining TB with LVI. The latter is a well-established biomarker which has been demonstrated to be of significant impact on survival. To this end we tested a combination of TB and LVI and were finally able to show that this combination revealed the highest prognostic significance, even stratifying the subgroup of tonsillar-related OPSCC.

Up to date only our own previous study analyzed TB in HPV-associated OPSCC [18]. In the respective study, we identified thresholds of 2 buds in 1 HPF and 6 buds in 10 HPF as prognostically most relevant. However, the study’s applicability on HPV-positive OPSCC was limited due to a relatively small number of patients with this type of cancer. The present study includes the largest number of p16-positive OPSCC patients to date. An extensive multicenter setting allowed us to reach this impressive number of patients which carries sufficient power for robust statistical analyses.

Applying the cutoffs 2 buds in 1 HPF and 6 buds in 10 HPF to our case series, we successfully stratified p16-positive OPSCC into low-risk and high-risk subgroups, observing significant differences in both OS and PFS in uni- and multivariate survival analysis for both, all OPSCC as well as the subgroup of OPSCC located within tonsillar related areas.

A topic of ongoing debate is the evaluation procedure of TB in HNSCC and squamous cell carcinomas in general. In colorectal cancer TB is evaluated in a 20x field (for further details see International Tumor Budding Consensus Conference [33]. In HNSCC (as well as in other squamous cell carcinoma entities including pulmonary squamous cell carcinoma), TB has been evaluated frequently in 1 HPF at 40x magnification (reviewed comprehensively in refs. [25, 34,35,36]). Consequently, we applied the latter HPF-method in our previous [18] as well as in our current study. To enable a transformation to digital evaluation, we calculated the size of 1 HPF in µm2, rendering an area of 97464 µm2being included in 1 HPF. Therefore, we believe that we are able to alleviate the comparability with other studies on TB in squamous cell carcinomas which preferably have applied this HPF method.

Cases with high TB were associated with poorer outcomes in univariate analysis and TB remained as an independent prognostic factor in multivariate analyses. Similar results were achieved when evaluating TB in both 1 and 10 HPF, indicating the applicability of TB assessment in both biopsy and resection specimens [37]. However, we recommend the 10 HPF method for evaluating TB if feasible as it has shown better reproducibility [33].

The negative prognostic impact of TB in p16-positive OPSCC aligns with similar findings in other cancer types, including HPV-negative HNSCC, colorectal cancer, lung cancer, breast cancer, and pancreatic cancer [25]. As numerous original studies and reviews have already demonstrated the prognostic value of TB in p16-negative HNSCC (reviewed amongst others in refs. [25, 35, 36, 38], we do not believe that an additional validation for this subtype is compatible with the design of our current study, which excludes p16-negative OPSCC. Furthermore, our results emphasize the prognostic value of TB in HNSCC: TB stands out as an easily applicable and cost-effective morphological parameter that may also be relevant for p16-positive OPSCC [39].

In our study, LVI emerged as another independent prognostic factor for worse outcomes regarding OS and PFS. Unlike TB, LVI is already considered an adverse factor in OPSCC, and its presence might lead to administering adjuvant radiotherapy [40, 41].

Taking into account that TB as well as LVI were significantly correlated with poor outcome, we probed to combine both in a combination scheme. Indeed, statistical analysis showed that the combination of TB and LVI is a significant prognosticator of patient outcome in uni- and multivariate analysis with even better capacity to stratify patients compared to each biomarker alone. Given the severe side effects associated with radio(chemo)therapy, several trials have explored de-intensification strategies in the treatment of p16-positive OPSCC [42]. From a clinical perspective, evaluating TB, particularly in conjunction with LVI, could help to distinguish low-risk p16-positive OPSCC patients who might benefit from less intensive treatment regimens from high-risk patients who may require the full spectrum of therapeutic options. The presence of TB, as a histomorphological indicator of aggressive tumor biology - particularly when co-occurring with LVI - may represent the morphological manifestation of a subset of p16-positive OPSCC associated with poor outcome. This subset shows comparably worse survival outcomes, despite the generally superior tumor biology characteristic of p16-positive OPSCC [25, 31, 43].

Tham et al. evaluated the influence of the anatomic subsite on survival in a large cohort of HPV-positive OPSCC and discriminated between tonsillar related areas and nontonsillar related areas. In our study the combined stratification scheme (TB and LVI) was able to identify a high-risk group of tonsillar related tumors (high TB and presence of LVI). However, no prognostic significance was reached for tumors with high TB and absence of LVI or low TB and presence of LVI. This might be attributable to the comparable small size of this subcohort with distinct anatomic subsites available. For nontonsillar related tumors no prognostic stratification of the combined scheme (TB and LVI) could be calculated as for this category only three cases could be assigned a distinct anatomic subsite.

Our results may pave the way to a patient stratification in analogy to colorectal cancer. Here, TB and LVI (amongst others) are combined in risk-scoring systems to identify patients with T1 carcinoma which need treatment escalation due to a high risk of metastasis and poor outcome [44]. This could be an approach for identification of high-risk cases amongst OPSCC patients as well.

Some authors have argued against conducting TB-scoring on HPV-associated HNSCC, positing that HPV-associated cancers typically do not exhibit TB, thus rendering TB scoring inappropriate [45]. However, in our analysis we included all p16-positive cases irrespective of their morphological characteristics. This approach is supported by literature suggesting that p16 is a reliable marker for an HPV-association in HNSCC located in the oropharynx [23, 24] and that there may be instances where HPV-associated cancers present with characteristics like a keratinizing phenotype [46, 47]. Although our analysis is not directly comparable to those involving molecularly confirmed HPV-positive OPSCC, our results nevertheless indicate a prognostically relevant stratification of p16-positive OPSCC by evaluating TB. Therefore, we believe that our study could provide relevant prognostic information for the treatment of patients with p16-positive OPSCC.

Interestingly, our study did not reveal an association between TB and the presence of extracapsular extension in lymph node metastasis. This lack of association might be due to the limited number of patients for whom data on extracapsular extension was available.

In our study, we did not observe any impact of TSR on the prognosis of p16-positive OPSCC. This finding is in contrast to a recent study by Almangush et al. [20] which demonstrated a poor outcome of HPV-associated OPSCC with high stromal content. However, most data on the prognostic impact of TSR in HNSCC is related to oral squamous cell carcinomas and data on the prognostic impact of TSR on other HNSCC subsites including the oropharynx is scarce [19, 21]. Taken together, the significance of TSR in OPSCC remains to be clarified in future studies.

However, our study has limitations. Current studies evaluating treatment de-escalation strategies for OPSCC typically consider factors such as an underlying HPV-association, tumor node metastasis (TNM) stage, and smoking status [48]. Due to the multicenter design of the study, additional molecular workups to confirm HPV-infection were not performed as access to formalin-fixed paraffin-embedded tumor tissue was not feasible; we only had access to digitized H&E-stained tissue sections from the five university centers. Likewise, our clinical data did not include information on patients’ smoking status or alcohol abuse. Thus, we were unable to directly assess whether TB/LVI is correlated with HPV-positivity based on RNA-or DNA-analysis, smoking status and/or alcohol abuse. One further drawback of our multicenter study design was that clinical data regarding the exact localization of the tumor within the oropharynx were not available for several cases. Thus, the prognostic significance of TB/LVI regarding nontonsillar related areas could not be determined. Therefore, additional studies specifically addressing these questions and examining if TB/LVI can provide valuable information in addition to the TNM stage in HPV-associated OPSCC are warranted.

Another limitation is the retrospective design of our study, with cases recruited over an extended period. This long timeframe may have led to variations in treatment courses due to changes in clinical practice. This limitation could be mitigated in future studies by evaluating TB/LVI in clinical trial populations treated more uniformly.

Nevertheless, we believe that our findings underscore the prognostic relevance of TB and LVI in p16-positive OPSCC and emphasize its potential utility in clinical practice.

Conclusion

In our evaluation of previously described cutoffs for TB, we were able to stratify p16-positive OPSCC with significant prognostic implications. In addition to TB, LVI emerged as another independent prognostic factor associated with inferior outcomes. Thus, morphological parameters such as TB and LVI, particularly when combined, might help in guiding clinical decision making regarding the most appropriate treatment regimen, whether it be intensified or de-intensified, for patients with OPSCC.