Extended Data Fig. 5: GC content estimated in the DH1 v2 and v3 genome assemblies.

a) GC content in the v2, v3 genomes and in the newly assembled sequences in the v3 genome. b) GC content in the genes predicted in the v3 genome and in newly assembled sequences. Note, for each fraction of genome and genes evaluated in this analysis (for example v2 genome, v3 genes) the frequency of bins for each GC level (1% GC windows) was rescaled independently setting the minimum number of bins to 0 and maximum number of bins to 100, and plotted on the y axis. The calculation was carried out using mapminmax function implemented in Matlab.