Abstract
Methods to characterize the functional effects of genetic variants of uncertain significance (VUSs) have been limited by incomplete coverage of the mutational space. In clinical oncology, drug resistance arising from VUSs can prevent optimal treatment. Here we introduce PEER-seq, a high-throughput method based on prime editing that can evaluate the functional effects of single-nucleotide variants (SNVs). PEER-seq introduces both intended SNVs and synonymous marker mutations using prime editing and deep sequences the endogenous target regions to identify the introduced SNVs. We generate and functionally evaluate 2,476 SNVs in the epidermal growth factor receptor gene (EGFR), including 99% of all possible variants in the canonical tyrosine kinase ___domain. We determined resistance profiles of 95% of all possible EGFR protein variants encoded in the whole tyrosine kinase ___domain against the common tyrosine kinase inhibitors afatinib, osimertinib and osimertinib in the presence of the co-occurring substitution T790M, in PC-9 cells. Our study has the potential to substantially improve the precision of therapeutic choices in clinical settings.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
27,99 € / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
209,00 € per year
only 17,42 € per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout






Similar content being viewed by others
Data availability
The deep sequencing data from this study were deposited to the National Center for Biotechnology Information’s Sequence Read Archive under accession number PRJNA1018283. We provided the datasets used in this study as Supplementary Tables 2–5.
Code availability
Screening data were analyzed with in-house custom Python scripts and MAGeCK (version 0.5.9.3). Custom Python scripts (version 3.8.16) were used to generate input files on the basis of grouped UMIs for MAGeCK; they are available from GitHub (https://github.com/oreolic/SGE_EGFR).
References
Yang, J. C. et al. Afatinib for patients with lung adenocarcinoma and epidermal growth factor receptor mutations (LUX-Lung 2): a phase 2 trial. Lancet Oncol. 13, 539–548 (2012).
Soria, J. C. et al. Osimertinib in untreated EGFR-mutated advanced non-small-cell lung cancer. N. Engl. J. Med. 378, 113–125 (2018).
Yun, C.-H. et al. Structures of lung cancer-derived EGFR mutants and inhibitor complexes: mechanism of activation and insights into differential inhibitor sensitivity. Cancer Cell 11, 217–227 (2007).
Lynch, T. J. et al. Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N. Engl. J. Med. 350, 2129–2139 (2004).
Paez, J. G. et al. EGFRmutations in lung cancer: correlation with clinical response to gefitinib therapy. Science 304, 1497–1500 (2004).
Sharma, S. V., Bell, D. W., Settleman, J. & Haber, D. A. Epidermal growth factor receptor mutations in lung cancer. Nat. Rev. Cancer 7, 169–181 (2007).
Robichaux, J. P. et al. Structure-based classification predicts drug response in EGFR-mutant NSCLC. Nature 597, 732–737 (2021).
Campo, M. et al. Acquired resistance to first-line afatinib and the challenges of prearranged progression biopsies. J. Thorac. Oncol. 11, 2022–2026 (2016).
Thress, K. S. et al. Acquired EGFR C797S mutation mediates resistance to AZD9291 in non-small cell lung cancer harboring EGFR T790M. Nat. Med. 21, 560–562 (2015).
Passaro, A., Jänne, P. A., Mok, T. & Peters, S. Overcoming therapy resistance in EGFR-mutant lung cancer. Nat. Cancer 2, 377–391 (2021).
Russo, A. et al. Heterogeneous responses to epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKIs) in patients with uncommon EGFR mutations: new insights and future perspectives in this complex clinical scenario. Int. J. Mol. Sci. 20, 1431 (2019).
Passaro, A. et al. Recent advances on the role of EGFR tyrosine kinase inhibitors in the management of NSCLC with uncommon, non exon 20 insertions, EGFR mutations. J. Thorac. Oncol. 16, 764–773 (2021).
Yang, J. C. et al. Afatinib for the treatment of NSCLC harboring uncommon EGFR mutations: a database of 693 cases. J. Thorac. Oncol. 15, 803–815 (2020).
Janning, M. et al. Treatment outcome of atypical EGFR mutations in the German National Network Genomic Medicine Lung Cancer (nNGM). Ann. Oncol. 33, 602–615 (2022).
Pretelli, G., Spagnolo, C. C., Ciappina, G., Santarpia, M. & Pasello, G. Overview on therapeutic options in uncommon EGFR mutant non-small cell lung cancer (NSCLC): new lights for an unmet medical need. Int. J. Mol. Sci. 24, 8878 (2023).
Kohsaka, S. et al. A method of high-throughput functional evaluation of EGFR gene variants of unknown significance in cancer. Sci. Transl. Med. 9, 416 (2017).
Chakroborty, D. et al. An unbiased in vitro screen for activating epidermal growth factor receptor mutations. J. Biol. Chem. 294, 9377–9389 (2019).
An, L. et al. Defining the sensitivity landscape of EGFR variants to tyrosine kinase inhibitors. Transl. Res. 255, 14–25 (2023).
Gaudelli, N. M. et al. Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage. Nature 551, 464–471 (2017).
Komor, A. C., Kim, Y. B., Packer, M. S., Zuris, J. A. & Liu, D. R. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 420–424 (2016).
Nishida, K. et al. Targeted nucleotide editing using hybrid prokaryotic and vertebrate adaptive immune systems. Science 353, aaf8729 (2016).
Anzalone, A. V. et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157 (2019).
Findlay, G. M. et al. Accurate classification of BRCA1 variants with saturation genome editing. Nature 562, 217–222 (2018).
Meitlis, I. et al. Multiplexed functional assessment of genetic variants in CARD11. Am. J. Hum. Genet. 107, 1029–1043 (2020).
Buckley, M. et al. Saturation genome editing maps the functional spectrum of pathogenic VHL alleles. Nat. Genet. 56, 1446–1455 (2024).
Hanna, R. E. et al. Massively parallel assessment of human variants with base editor screens. Cell 184, 1064–1080 (2021).
Cuella-Martin, R. et al. Functional interrogation of DNA damage response variants with base editing screens. Cell 184, 1081–1097 (2021).
Kim, Y. et al. High-throughput functional evaluation of human cancer-associated mutations using base editors. Nat. Biotechnol. 40, 874–884 (2022).
Sanchez-Rivera, F. J. et al. Base editing sensor libraries for high-throughput engineering and functional analysis of cancer-associated single nucleotide variants. Nat. Biotechnol. 40, 862–873 (2022).
Perner, F. et al. MEN1 mutations mediate clinical resistance to menin inhibition. Nature 615, 913–919 (2023).
Erwood, S. et al. Saturation variant interpretation using CRISPR prime editing. Nat. Biotechnol. 40, 885–895 (2022).
Findlay, G. M., Boyle, E. A., Hause, R. J., Klein, J. C. & Shendure, J. Saturation editing of genomic regions by multiplex homology-directed repair. Nature 513, 120–123 (2014).
Radford, E. J. et al. Saturation genome editing of DDX3X clarifies pathogenicity of germline and somatic variation. Nat. Commun. 14, 7702 (2023).
Huang, H. et al. Saturation genome editing-based functional evaluation and clinical classification of BRCA2 single nucleotide variants. Preprint at bioRxiv https://doi.org/10.1101/2023.12.14.571597 (2023).
Chen, P. J. et al. Enhanced prime editing systems by manipulating cellular determinants of editing outcomes. Cell 184, 5635–5652 (2021).
Li, X. et al. Highly efficient prime editing by introducing same-sense mutations in pegRNA or stabilizing its structure. Nat. Commun. 13, 1669 (2022).
Yu, G. et al. Prediction of efficiencies for diverse prime editing systems in multiple cell types. Cell 186, 2256–2272 (2023).
Salk, J. J., Schmitt, M. W. & Loeb, L. A. Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations. Nat. Rev. Genet. 19, 269–285 (2018).
Stoler, N. & Nekrutenko, A. Sequencing error profiles of Illumina sequencing instruments. NAR Genom. Bioinform. 3, lqab019 (2021).
Broad Institute of Harvard and MIT. DepMap: the cancer dependency map project at Broad Institute. DepMap https://depmap.org/portal/ (2020).
Hart, T. et al. High-Resolution CRISPR screens reveal fitness genes and genotype-specific cancer liabilities. Cell 163, 1515–1526 (2015).
Doench, J. G. et al. Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR–Cas9. Nat. Biotechnol. 34, 184–191 (2016).
Doench, J. G. Am I ready for CRISPR? A user’s guide to genetic screens. Nat. Rev. Genet. 19, 67–80 (2018).
Ren, X. et al. High-throughput PRIME-editing screens identify functional DNA variants in the human genome. Mol. Cell 83, 4633–4645 (2023).
Gould, S. I. et al. High-throughput evaluation of genetic variants with prime editing sensor libraries. Nat. Biotechnol. https://doi.org/10.1038/s41587-024-02172-9 (2024).
Chardon, F. M. et al. A multiplex, prime editing framework for identifying drug resistance variants at scale. Preprint at bioRxiv https://doi.org/10.1101/2023.07.27.550902 (2023).
Li, W. et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 15, 554 (2014).
Yu, H. A. et al. Analysis of tumor specimens at the time of acquired resistance to EGFR-TKI therapy in 155 patients with EGFR-mutant lung cancers. Clin. Cancer Res. 19, 2240–2247 (2013).
Mok, T. S. et al. Osimertinib or platinum–pemetrexed in EGFR T790M-positive lung cancer. N. Engl. J. Med. 376, 629–640 (2016).
Tan, C.-S. et al. Third generation EGFR TKIs: current data and future directions. Mol. Cancer 17, 29 (2018).
Kobayashi, Y. et al. Characterization of EGFR T790M, L792F, and C797S mutations as mechanisms of acquired resistance to afatinib in lung cancer. Mol. Cancer Ther. 16, 357–364 (2017).
Liu, Y. et al. Acquired EGFR L718V mutation mediates resistance to osimertinib in non-small cell lung cancer but retains sensitivity to afatinib. Lung Cancer 118, 1–5 (2018).
Yang, Z. et al. Investigating novel resistance mechanisms to third-generation EGFR tyrosine kinase inhibitor osimertinib in non-small cell lung cancer patients. Clin. Cancer Res. 24, 3097–3107 (2018).
Li, M. et al. L718Q/V mutation in exon 18 of EGFR mediates resistance to osimertinib: clinical features and treatment. Discov. Oncol. 13, 72 (2022).
Sueoka-Aragane, N. et al. The role of comprehensive analysis with circulating tumor DNA in advanced non-small cell lung cancer patients considered for osimertinib treatment. Cancer Med. 10, 3873–3885 (2021).
Carlo, D. E. et al. Acquired EGFR C797G mutation detected by liquid biopsy as resistance mechanism after treatment with osimertinib: a case report. In Vivo 35, 2941–2945 (2021).
Nie, K. et al. Mutational profiling of non-small-cell lung cancer resistant to osimertinib using next-generation sequencing in chinese patients. BioMed Res. Int. 2018, 9010353 (2018).
Avizienyte, E., Ward, R. A. & Garner, A. P. Comparison of the EGFR resistance mutation profiles generated by EGFR-targeted tyrosine kinase inhibitors and the impact of drug combinations. Biochem. J. 415, 197–206 (2008).
Bean, J. et al. Acquired resistance to epidermal growth factor receptor kinase inhibitors associated with a novel T854A mutation in a patient with EGFR-mutant lung adenocarcinoma. Clin. Cancer Res. 14, 7519–7525 (2008).
Zhang, L. et al. Molecular characteristics of the uncommon EGFR exon 21 T854A mutation and response to osimertinib in patients with non-small cell lung cancer. Clin. Lung Cancer 23, 311–319 (2022).
Xing, P. et al. Co-mutational assessment of circulating tumour DNA (ctDNA) during osimertinib treatment for T790M mutant lung cancer. J. Cell. Mol. Med. 23, 6812–6821 (2019).
Brown, B. P. et al. Allele-specific activation, enzyme kinetics, and inhibitor sensitivities of EGFR exon 19 deletion mutations in lung cancer. Proc. Natl Acad. Sci. USA 119, e2206588119 (2022).
Malapelle, U. et al. Profile of the Roche cobas® EGFR mutation test v2 for non-small cell lung cancer. Expert Rev. Mol. Diagn. 17, 209–215 (2017).
Yu, Z. et al. Resistance to an irreversible epidermal growth factor receptor (EGFR) inhibitor in EGFR-mutant lung cancer reveals novel treatment strategies. Cancer Res. 67, 10417–10427 (2007).
Zhang, X., Gureasko, J., Shen, K., Cole, P. A. & Kuriyan, J. An allosteric mechanism for activation of the kinase ___domain of epidermal growth factor receptor. Cell 125, 1137–1149 (2006).
Hu, Y. et al. Discrimination of germline EGFR T790M mutations in plasma cell-free DNA allows study of prevalence across 31,414 cancer patients. Clin. Cancer Res. 23, 7351–7359 (2017).
Leonetti, A. et al. Resistance mechanisms to osimertinib in EGFR-mutated non-small cell lung cancer. Br. J. Cancer 121, 725–737 (2019).
Brown, B. P. et al. On-target resistance to the mutant-selective EGFR inhibitor osimertinib can develop in an allele-specific manner dependent on the original EGFR-activating mutation. Clin. Cancer Res. 25, 3341–3351 (2019).
Wei, Y. et al. Afatinib as a potential therapeutic option for patients with NSCLC with EGFR G724S. JTO Clin. Res. Rep. 2, 100193 (2021).
Zhang, Q. et al. EGFR L792H and G796R: two novel mutations mediating resistance to the third-generation EGFR tyrosine kinase inhibitor osimertinib. J. Thorac. Oncol. 13, 1415–1421 (2018).
Klempner, S. J., Mehta, P., Schrock, A. B., Ali, S. M. & Ou, S. I. cis-oriented solvent-front EGFR G796S mutation in tissue and ctDNA in a patient progressing on osimertinib: a case report and review of the literature. Lung Cancer 8, 241–247 (2017).
Zheng, D. et al. EGFR G796D mutation mediates resistance to osimertinib. Oncotarget 8, 49671–49679 (2017).
Liu, Y., Yang, Q. & Zhao, F. Synonymous but not silent: the codon usage code for gene expression and protein folding. Annu. Rev. Biochem. 90, 375–401 (2021).
John, T. et al. Uncommon EGFR mutations in non-small-cell lung cancer: a systematic literature review of prevalence and clinical outcomes. Cancer Epidemiol. 76, 102080 (2022).
Mathis, N. et al. Machine learning prediction of prime editing efficiency across diverse chromatin contexts. Nat. Biotechnol. https://doi.org/10.1038/s41587-024-02268-2 (2024).
Nelson, J. W. et al. Engineered pegRNAs improve prime editing efficiency. Nat. Biotechnol. 40, 402–410 (2022).
Mantaci, S., Restivo, A. & Sciortino, M. Distance measures for biological sequences: some recent approaches. Int. J. Approx. Reason. 47, 109–124 (2008).
Kim, H. K. et al. In vivo high-throughput profiling of CRISPR–Cpf1 activity. Nat. Methods 14, 153–159 (2017).
Dang, Y. et al. Optimizing sgRNA structure to improve CRISPR–Cas9 knockout efficiency. Genome Biol. 16, 280 (2015).
Sanjana, N. E., Shalem, O. & Zhang, F. Improved vectors and genome-wide libraries for CRISPR screening. Nat. Methods 11, 783–784 (2014).
Miller, S. M. et al. Continuous evolution of SpCas9 variants compatible with non-G PAMs. Nat. Biotechnol. 38, 471–481 (2020).
Briggs, A. W. et al. Iterative capped assembly: rapid and scalable synthesis of repeat-module DNA such as TAL effectors from individual monomers. Nucleic Acids Res. 40, e117 (2012).
Kim, S., Bae, T., Hwang, J. & Kim, J. S. Rescue of high-specificity Cas9 variants using sgRNAs with matched 5′ nucleotides. Genome Biol. 18, 218 (2017).
Park, J., Lim, K., Kim, J. S. & Bae, S. Cas-analyzer: an online tool for assessing genome editing results using NGS data. Bioinformatics 33, 286–288 (2017).
Koblan, L. W. et al. Improving cytidine and adenine base editors by expression optimization and ancestral reconstruction. Nat. Biotechnol. 36, 843–846 (2018).
Heckl, D. & Charpentier, E. Toward whole-transcriptome editing with CRISPR–Cas9. Mol. Cell 58, 560–562 (2015).
Shalem, O. et al. Genome-scale CRISPR–Cas9 knockout screening in human cells. Science 343, 84–87 (2014).
DeWeirdt, P. C. et al. Optimization of AsCas12a for combinatorial genetic screens in human cells. Nat. Biotechnol. 39, 94–104 (2021).
Chen, W. et al. Massively parallel profiling and predictive modeling of the outcomes of CRISPR/Cas9-mediated double-strand break repair. Nucleic Acids Res. 47, 7989–8003 (2019).
Cerami, E. et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2, 401–404 (2012).
Sondka, Z. et al. COSMIC: a curated database of somatic variants and clinical data for cancer. Nucleic Acids Res. 52, D1210–D1217 (2024).
Ettinger, D. S. et al. Non-small cell lung cancer, version 3.2022, NCCN clinical practice guidelines in oncology. J. Natl Compr. Canc. Netw. 20, 497–530 (2022).
Ten Hacken, E. et al. High throughput single-cell detection of multiplex CRISPR-edited gene modifications. Genome Biol. 21, 266 (2020).
Cross, D. A. et al. AZD9291, an irreversible EGFR TKI, overcomes T790M-mediated resistance to EGFR inhibitors in lung cancer. Cancer Discov. 4, 1046–1061 (2014).
Zhang, K. R. et al. Targeting AKR1B1 inhibits glutathione de novo synthesis to overcome acquired resistance to EGFR-targeted therapy in lung cancer. Sci. Transl. Med. 13, eabg6428 (2021).
Acknowledgements
We thank J. Hong, J. Lee, B. Park, Y. Kim, G. Baek and S. Park for assisting with the experiments. Schematics in Figs. 2a, 3b and 6b, and Supplementary Figs. 10a and 12a created with BioRender.com. This work was supported, in part, by the National Research Foundation (NRF) of Korea grant funded by the Korean Ministry of Science and ICT (MSIT) (2022R1A3B1078084 and 2018R1A5A2025079 (to H.H.K.)), the Bio and Medical Technology Development Program of the NRF funded by the Korean MSIT (2022M3A9E4017127, 2022M3A9F3017506 and RS-2023-00260968 (to H.H.K.)), the Yonsei Signature Research Cluster Program of 2024-22-0165 (to H.H.K.), the Brain Korea 21 FOUR Project for Medical Science (Yonsei University College of Medicine), the SNUH Kun-hee Lee Child Cancer and Rare Disease Project, Republic of Korea (22B-000-0101 (to H.H.K.)), the Yonsei Fellow Program, funded by Lee Youn Jae (to H.H.K.), the Genome Editing Research Program funded by the Korean MSIT (RS-2023-00263285 (to Y.K.)), the Basic Science Research Program through the NRF of Korea funded by the Ministry of Education (2022R1I1A1A01066096 (to Y.K.)), a faculty research grant of Yonsei University College of Medicine (6-2022-0078 (to Y.K.)), the Catholic Medical Center Research Foundation in the program year of 2023 (5-2023-B0001-00047 (to Y.K.)), the Basic Medical Science Facilitation Program through the Catholic Medical Center of The Catholic University of Korea funded by the Catholic Education Foundation (to Y.K.) and a grant of the MD–PhD/Medical Scientist Training Program (to H.C.O.) through the Korea Health Industry Development Institute, funded by the Korean Ministry of Health and Welfare.
Author information
Authors and Affiliations
Contributions
Y.K., H.C.O., S.L. and H.H.K. conceptualized and designed the research. Y.K., H.C.O. and S.L. performed the experiments and data analysis. Y.K., H.C.O., S.L. and H.H.K. wrote the paper.
Corresponding author
Ethics declarations
Competing interests
Yonsei University has filed a patent application based on this work, in which Y.K., H.C.O., S.L. and H.H.K. are listed as inventors. H.H.K is the founder of LiquidCRISPR.
Peer review
Peer review information
Nature Biotechnology thanks the anonymous reviewers for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Sequencing errors can hinder the accurate identification of SNVs induced by prime editing.
a,b. Maps of lentiviral vectors used for the expression of prime editor (PEmax) (a) and pegRNAs (b). bpNLS, bipartite nuclear localization signal; MMLV-RT, human codon-optimized Moloney murine leukemia virus reverse transcriptase; ITR, inverted termainal repeat. (b) Locations of PCR primers used to deep-sequence a 229-bp region containing the RT template, PBS (primer binding site), pegRNA barcode, and unique molecular identifier (UMI) are shown. c, Proportion of sequencing reads containing substitution(s) in unedited PC-9 cells expressing NRCH-PEmax (left, unedited control) and in those ten days after transduction with the NRCH-exon20 library (right) for editing exon 20 of EGFR. d, Heatmap showing odds ratios and/or P-values of 558 (=186 ×3) SNVs generated by prime editing in exon 20 of EGFR ten days after the transuction of the NRCH-exon20 library. SNVs with P-values greater than 0.05 by the two-sided Fisher’s exact test are indicated in red; in these cases, odds ratios are not shown. SNVs with odds ratios lower than 3 are shown with a white background. The numbers at the bottom of each heatmap represent the ___location in the EGFR coding sequence. At each position, the nucleotide in the reference sequence is shown. e,f, Distribution of odds ratios (e) and P-values (f) of 558 SNVs in cells transduced with the NRCH-exon20 library. The dashed horizontal lines indicate the position at which the odds ratio = 3 (e) and the P-value = 0.05 (f). The P-value was calculated using a two-sided Fisher’s exact test. g, Distribution of observed SNV frequencies in PC-9 cells expresing PEmax ten days after transduction with NRCH-exon20. The number of SNVs n = 524 (Nonsignificant), n = 34 (Significant). Boxes represent the 25th, 50th(median), and 75th percentiles, and whiskers show the 10th and 90th percentiles.
Extended Data Fig. 2 Optimization of PEER-seq and comparison with pegRNA-based analysis.
a, Correction of positional biases. To correct positional biases in LFCs, we employed LOWESS regression by using synonymous SNVs, which were assumed to have no functional effect. The LOWESS regression curves are shown as orange lines. The overall depletion of nonsense SNVs was more distinct after the adjustment. b, ROC-AUC analysis to determine the effect of positional bias correction for sets of nonsense (the number of SNVs n = 27) vs. synonymous SNVs (n = 117) in exon 2 of RPL15. Pre, ROC-AUC before adjustment; Post, ROC-AUC after adjustment. c, Correlation between PEER-seq LFC values from two biological replicates. The Pearson correlation coefficient (r) is shown. The number of SNVs n = 511. d,e, Heatmaps showing adjusted LFCs of 516 (=172 × 3) SNVs (d) and 336 protein variants (e) generated by prime editing in exon 2 of RPL15. SNVs (d) and protein variants (e) with P-values of two-sided Fisher’s exact test greater than 0.05 or odds ratios lower than 3 were excluded from the analysis and are shown with a white background. SNVs (d) and protein variants (e) for which no pegRNAs were designed are shown as gray boxes. The numbers at the bottom of each heatmap represent the ___location in the RPL15 coding sequence (d) and in the RPL15 amino acid sequence (e). At each position, the nucleotide (d) and amino acid (e; WT, wild-type; top) in the reference sequences are shown. f, Correlation between adjusted LFC values of SNVs determined by pegRNA abundance-based analysis from two biological replicates. The Pearson correlation coefficient (r) is shown. The number of SNVs n = 511. g, Kernel density estimation plots of adjusted LFCs of SNVs in RPL15 determined by pegRNA abudance-based analysis as a function of the SNV category. For each category, the number and percentage of SNVs with adjusted LFC values lower than a cutoff value (the gray dashed vertical line), representing the 5th percentile of the adjusted LFC values of synonynous mutations, are shown. h, Correlation between adjusted LFC values of SNVs calculated from PEER-seq evaluations and those from pegRNA abundance-based analysis. The Pearson correlation coefficient (r) is shown. The number of SNVs n = 511.
Extended Data Fig. 3 PEER-seq evaluation of SNVs in BRCA1.
a, Flow cytometry gating strategy used to isolate haploid cells. b, Correlation between PEER-seq LFC values of SNVs in BRCA1 from two biological replicates. The Pearson correlation coefficient (r) is shown. The number of SNVs n = 239. c, Correlation between HDR function scores obtained by Findlay et al. (2018) and PEER-seq LFC values for SNVs in exon 19 of BRCA1. The Pearson correlation coefficient (r) is shown. The number of SNVs n = 239. d, Kernel density estimation plots of adjusted LFCs of SNVs in the region encoding exon 19 of BRCA1 as a function of the category of SNV. For each category, the number and percentage of SNVs with adjusted LFC values lower than a cutoff value (the gray dashed vertical line), representing the 5th percentile of adjusted LFC values of synoynous mutations, are shown. ‘Canonical splice’ denotes the two intronic positions immediately flanking exon 19 of BRCA1 and ‘splice region’ denotes three other intronic positions. e,f, ROC curves for adjusted LFCs of SNVs determined by PEER-seq (blue) and HDR function scores (red) for sets of SNVs with pathogenic/likely pathogenic classifications (the number of SNVs n = 10) vs. SNVs with benign/likely benign classifications (n = 6) from ClinVar in exon 19 of BRCA1 (e) or sets of nonsense and canonical splice sites (n = 17) vs. synonymous SNVs (n = 44) (f). Area under curve values are shown.
Extended Data Fig. 4 Identified SNVs.
Heatmap showing odds ratios and/or P-values of 2,610 (=870 × 3) SNVs generated by prime editing in exons 18-24 of EGFR ten days after the transduction of the Syn-exon18, Syn-exon19, …, Syn-exon24 libraries. SNVs with P-values greater than 0.05 by the two-sided Fisher’s exact test are indicated in red; in these cases, odds ratios are not shown. SNVs with odds ratios lower than 3 are shown with a white background. The numbers at the bottom of each heatmap represent the ___location in the EGFR coding sequence. At each position, the nucleotide in the reference sequence is shown. Edited reads were identified based on the the presence of both the intended edit and an additional synonymous edit.
Extended Data Fig. 5 Evaluation of resistance profiles of EGFR protein variants.
a, Correlation between resistance scores following treatment with afatinib (left), osimertinib in the absence of T790M (middle), and osimertinib in the presence of T790M (right) in pairs of SNVs encoding the same protein variants. The classification of each protein variant is indicated by the dot color. Pearson correlation coefficients (r) are shown. The number of SNV pairs n = 218 (left), 218 (middle), and 210 (right). b, Correlation between resistance scores of protein variants following treatment with afatinib (left), osimertinib in the absence of T790M (middle), and osimertinib in the presence of T790M (right) in two biological replicates. The classification of each protein variant is indicated by the dot color. Pearson correlation coefficients are shown. The number of protein variants n = 1,726 (left), 1,726 (middle), and 1,671 (right). c, The number of sensitive and resistant protein variants functionally classified in the current study. The percentages of protein variants lacking previously published information about their effect on drug resistance, among all sensitive or resistant variants, are indicated on the blue bars.
Extended Data Fig. 6 Heatmap showing afatinib resistance scores of 1,817 protein variants.
These variants were generated by prime editing in exons 18-24 of EGFR in PC-9 cells. Boxes outlined in yellow and gray indicate protein variants causing resistant and intermediate phenotypes, respectively. The numbers at the bottom of each heatmap represent the ___location in the EGFR amino acid sequence. At each position, the amino acid in the reference sequence is shown at the top. Thirty protein variants with P-values of two-sided Fisher’s exact test greater than 0.05 or odds ratios lower than 3 were excluded from the analysis and are shown with a white background. Protein variants for which no pegRNAs were designed are shown as gray boxes.
Extended Data Fig. 7 Heatmap showing osimertinib resistance scores of 1,817 protein variants.
These variants were generated by prime editing in exons 18-24 of EGFR in PC-9 cells. Boxes outlined in yellow and gray indicate protein variants causing resistant and intermediate phenotypes, respectively. The numbers at the bottom of each heatmap represent the ___location in the EGFR amino acid sequence. At each position, the amino acid in the reference sequence is shown at the top. Thirty protein variants with P-values of two-sided Fisher’s exact test greater than 0.05 or odds ratios lower than 3 were excluded from the analysis and are shown with a white background. Protein variants for which no pegRNAs were designed are shown as gray boxes.
Extended Data Fig. 8 Heatmap showing osimertinib resistance scores of 1,817 protein variants in the presence of a co-occuring T790M mutation.
These variants were generated by prime editing in exons 18-24 of EGFR in PC-9 cells containing the T790M mutation. Boxes outlined in yellow and gray indicate protein variants causing resistant and intermediate phenotypes, respectively. The numbers at the bottom of each heatmap represent the ___location in the EGFR amino acid sequence. At each position, the amino acid in the reference sequence is shown at the top. Ninety-four protein variants with P-values of two-sided Fisher’s exact test greater than 0.05 or odds ratios lower than 3 were excluded from the analysis and are shown with a white background. Protein variants for which no pegRNAs were designed are shown as gray boxes.
Extended Data Fig. 9 Zygosity of prime edits.
a, Distribution of reads for each single cell-derived clone containing the indicated SNVs. ‘SNVs’ (shown in sky blue) indicates reads that contain the indicated SNVs without any other mutations. ‘Wild-type’ (green) indicates reads without the intended SNVs or any other mutations. ‘Other’ (yellow) indicates reads that fall into neither of these categories. The number of analyzed single cell-derived clones n = 51 for G930R, 50 for K754Q, and 38 for C797S in EGFR, 12 for K13* and 17 for A48E in RPL15. The stacked bars shown on the left of each graph represent the reads for the populations from which the clones are derived. b, Zygosity of the prime editing-induced SNVs indicated on the x axis. Given that the PC-9 cells that we used contain six and two copies of EGFR and RPL15, respectively, we classified the intended prime edits as partial gene copy editing if the percentage of reads containing SNVs in a clone ranged from 8.3% (=100/6 × 0.5) to 91.7% (100–8.3%) for EGFR and from 20% to 80% for RPL15.
Extended Data Fig. 10 PEER-seq experiments with higher concentrations of TKIs.
a, Correlation between PEER-seq resistance scores for mutations in exon 20 of EGFR following treatment with 8 nM osimertinib or a higher dose of osimertinib. The classification of each SNV is indicated by the dot color. The Pearson correlation coefficients (r) are shown. b, Comparison between functional classification results from PEER-seq experiments following treatment with 8 nM osimertinib and those with higher doses of osimertinib. The intensity of the color associated with entries in a given column was determined by the relative number of variants (that is, the percentages shown within the parentheses) within each category in that column. The variants are listed when there are clear discrepancies between evaluation results using different concentrations of osimertinib. c, Relative cell counts, compared to the cell count at seeding, are shown for the time point after 5 days of treatment with the specified doses of osimertinib. Error bars represent the mean and standard errors. The number of replicates n = 3.
Supplementary information
Supplementary Information
Supplementary Notes 1–4, Figs. 1–12 and Tables 2, 6 and 7.
Supplementary Tables
Supplementary Tables 1, 3, 4, 5, 8, 9 and 10.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kim, Y., Oh, HC., Lee, S. et al. Saturation profiling of drug-resistant genetic variants using prime editing. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02465-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41587-024-02465-z
This article is cited by
-
Perturbomics: CRISPR–Cas screening-based functional genomics approach for drug target discovery
Experimental & Molecular Medicine (2025)
-
Prime editor-based high-throughput screening reveals functional synonymous mutations in human cells
Nature Biotechnology (2025)