Fig. 6

Biotype annotation counts. Distribution of coding and non-coding gene biotypes among the analyzed features. Protein-coding genes represent the majority (17,148 genes), followed by processed pseudogenes (2,869 genes), long non-coding RNAs (lncRNAs, 2,040 genes), and other non-coding RNAs (328 genes). In total, 22,385 genes were annotated by biotype out of 35,144 total genes present in the dataset, highlighting the inclusion of both coding and non-coding transcriptomic features.