Extended Data Fig. 5: Age-related changes in A2/M158+CD8+ CDR3αβ repertoire and probability of generation.

Distribution of CDR3α (a) and CDRβ (b) amino acid lengths, calculated using the Chothia nomenclature, across all age groups. Average-linkage dendrograms of TCR clustering for the TCRα (c) and TCRβ (d) A2/M158+CD8+ repertoires generated by TCRdist. Each clustering was generated using a fixed-distance threshold algorithm and colored by generation probability (red, highest; blue, lowest probability of ease of TCR recombination). The probability is relative between TCRs across different age groups. TCRlogos for selected subsets (corresponding to the branches of the dendrogram enclosed in dashed boxes) are shown, labeled by cluster size both to the left of each logo and to the right of the corresponding branches. Each TCR logo depicts the V- and J-gene frequencies, the CDR3 amino acid sequence, and the inferred rearrangement structure of the grouped receptors (colored by source region, light gray for the V-region, dark gray for J, black for D, and red for N-insertions).