Extended Data Fig. 2: Variant locations relative to cis-regulatory features. | Nature Genetics

Extended Data Fig. 2: Variant locations relative to cis-regulatory features.

From: Prioritization of autoimmune disease-associated genetic variants that perturb regulatory element activity in T cells

Extended Data Fig. 2

a) Location relative to TSSs of all MPRA tested variants, active elements (pCRE), and emVars. b) Enrichment of variants within pCREs (light blue) and emVars (dark blue) within chromHMM-defined genomic regions in human T cells. * for P < 0.05; *** P < 1.4 ×10−3 (Bonferroni-corrected for 36 independent tests). c) Functional enrichment of variants within pCREs and emVars. *** for P < 6.3 ×10−3 (nominal p-value threshold of 0.05 Bonferroni-corrected for 8 independent tests). d) Proportion of inactive element and pCRE variants and emVars that have allelic bias in ATAC-seq. e) Scatter plot comparing MPRA log2 allelic bias (y axis) with allelic bias in ATAC-seq from hematopoietic cells (x axis)5. Red dots are emVars (n = 5) and gray dots are pCRE variants (n = 45). f) Proportion of MPRA inactive and pCRE variants, and emVars that are chromatin accessibility QTLs (caQTLs) from T cells65. g) Scatter plot comparing caQTL effect size (beta; x axis) and MPRA log2 allelic bias (y axis). Red dots are emVars (n = 6) and gray dots are pCREs (n = 22). h) Scatter plot comparing deltaSVM score (x axis) with MPRA log2 allelic bias (y axis) (n = 278). i) Proportion of MPRA inactive and pCRE variants and emVars that overlap TF motifs. j) Scatter plot comparing allele-specific TF binding scores (y axis) and MPRA allelic bias (x axis) for emVars predicted to perturb TF binding (n = 11). Enrichment of emVars for TF ChIP-seq (-log10P on y axis). Calculations for (b and c) are risk ratios (see Methods) with Fisher’s exact test P values and Bonferroni correction (see Supplementary Tables 5 and 6 for exact P values). (d, f, and i) P values calculated using two-sided two proportions z test with no multiple comparisons adjustment. (e, g, h and j) P values are from linear regression F statistic. (k) P values are from a two-sided binomial test.

Back to article page