Extended Data Fig. 1: Quality and reproducibility of RNA-seq and Ribo-seq data. | Nature

Extended Data Fig. 1: Quality and reproducibility of RNA-seq and Ribo-seq data.

From: Pervasive downstream RNA hairpins dynamically dictate start-codon selection

Extended Data Fig. 1

a, BioAnalyzer profiles showed high quality of the Ribo-seq libraries. Apart from the internal standard sized at 35 bp and 10,380 bp, a single peak at around 150 bp was present in all the libraries for mock and elf18 treatment in all three biological replicates (Reps 1–3). b, Length distribution of reads from the Ribo-seq libraries. c,d, Correlations among the three replicates of RNA-seq (c) and Ribo-seq (d) data from mock- and elf18-treated samples. Data are shown as correlations of log2(RPKM+1) for all the genes. r, Pearson correlation coefficient. e, Metagene analysis on the average read counts surrounding start and stop codons for reads at different lengths (top). P-site offsets were detected at the length of 13–15 nt surrounding start codons and at the length of 17–19 nt surrounding stop codons (bottom). 5′ LS, 5′ leader sequence. f, Power spectral density of normalized Ribo-seq read counts in the 300-nt window downstream of the start codon shows 3-nt periodicity. The units are (normalized read counts)^2 per nucleotide period. g, Total RNA-seq and Ribo-seq read distribution in 5′ LS, CDS, and 3′ UTR of the 13,051 expressed transcripts (n = 13,051). Boxes, IQR. Centre lines, median. Whiskers, values within 1.5 × IQR of the top and bottom quartiles. Grey circles represent RPKM values for individual outlier transcripts. h, Metagene analysis across normalized transcript for Ribo-seq reads in all the mock and elf18-induced samples with the read length ranging from 24 nt to 35 nt. 5′ LS, 5′ leader sequence. 3′ UTR, 3′ untranslated region.

Back to article page