Extended Data Fig. 1: Additional characteristics of the HTMCP discovery and extension cohorts.

a. Coding mutations per Mb in samples exhibiting low (≤ 0.4) and high (> 0.4) APOBEC signatures. b. Difference in PIK3CA expression by HIV status. c. Mutations in the top 20 most mutated epigenetic modifiers, ordered by frequency of alterations for the cohort (n = 118). APOBEC signature proportion and homologous recombination deficiency (HRD) scores are reported above. HIV status, age at diagnosis, tumor histology (”other” includes neuroendocrine and undifferentiated) and stage are also annotated. d. Comparison of mutation frequencies of the 12 SMGs in the discovery vs. extension cohorts. Boxplots in a and b represent the median, upper and lower quartiles of the distribution and whiskers represent the limits of the distribution (1.5-times interquartile range), and statistics were determined using two-sided Wilcoxon rank sum tests.