Fig. 2

Data preprocessing statistic map. Adapter: reads with adapters; Low Quality: low-quality reads; PolyA: number of reads containing polyA (%); N: single-ended reads containing more than 10% of N bases; Duplication: total length of the removed repeat reads sequence.