Abstract
Resolving the molecular basis of a Mendelian condition remains challenging owing to the diverse mechanisms by which genetic variants cause disease. To address this, we developed a synchronized long-read genome, methylome, epigenome and transcriptome sequencing approach, which enables accurate single-nucleotide, insertion–deletion and structural variant calling and diploid de novo genome assembly. This permits the simultaneous elucidation of haplotype-resolved CpG methylation, chromatin accessibility and full-length transcript information in a single long-read sequencing run. Application of this approach to an Undiagnosed Diseases Network participant with a chromosome X;13-balanced translocation of uncertain significance revealed that this translocation disrupted the functioning of four separate genes (NBEA, PDK3, MAB21L1 and RB1) previously associated with single-gene Mendelian conditions. Notably, the function of each gene was disrupted via a distinct mechanism that required integration of the four ‘omes’ to resolve. These included fusion transcript formation, enhancer adoption, transcriptional readthrough silencing and inappropriate X-chromosome inactivation of autosomal genes. Overall, this highlights the utility of synchronized long-read multi-omic profiling for mechanistically resolving complex phenotypes.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
27,99 € / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
209,00 € per year
only 17,42 € per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout





Similar content being viewed by others
Data availability
All raw and processed sequencing data generated for GM12878, GM23485, GM20129 and HG02630 in this study have been submitted to the NCBI BioProject database (https://www.ncbi.nlm.nih.gov/bioproject/) under accession PRJNA1124997. Restrictions apply to the availability of some data generated or analyzed during this study to preserve subject confidentiality. Genomic data for our case report are accessible to the scientific community through the UDN (dbGaP). Individuals interested in accessing UDN data through dbGaP should submit a data access request. Detailed instructions for this process can be found on the NIH Scientific Data Sharing website (https://sharing.nih.gov/accessing-data/accessing-genomic-data/how-to-request-and-access-datasets-from-dbgap). Cell lines obtained from the National Institute of General Medical Sciences Human Genetic Cell Repository at the Coriell Institute for Medical Research include GM12878, GM23485, GM20129 and HG02630. The UDN318336 fibroblast line was obtained directly from UDN participant UDN318336. Hia5 enzyme and plasmid are available upon request. ONT data for the CpG comparison are available from Epi2me (https://labs.epi2me.io/giab-2023.05/, super accuracy analysis level 60× BAM files).
Processed FIRE results for GM12878, GM23485, GM20129, HG02630, UDN318336 and retinal organoid derived from UDN318336 are publicly available through https://doi.org/10.5281/zenodo.14511246, as well as through GitHub (https://github.com/StergachisLab/Fiber-seq-publication-data). Source data are provided with this paper.
Code availability
All software used in the study is publicly available. Specifically, the custom FIRE pipeline multi-ome-v0.1 (ref. 53; https://doi.org/10.5281/zenodo.12701477 and https://github.com/fiberseq/FIRE) and the custom k-mer-variant-phasing pipeline v0.0.1 (ref. 54; https://doi.org/10.5281/zenodo.10655504 and https://github.com/mrvollger/k-mer-variant-phasing) are publicly available and accessioned on Zenodo.
References
Nurk, S. et al. The complete sequence of a human genome. Science 376, 44–53 (2022).
Merker, J. D. et al. Long-read genome sequencing identifies causal structural variation in a Mendelian disease. Genet. Med. 20, 159–163 (2018).
Hiatt, S. M. et al. Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders. HGG Adv. 2, 100023 (2021).
Cohen, A. S. A. et al. Genomic answers for children: dynamic analyses of >1000 pediatric rare disease genomes. Genet. Med. 24, 1336–1348 (2022).
Aganezov, S. et al. A complete reference genome improves analysis of human genetic variation. Science 376, eabl3533 (2022).
Lunke, S. et al. Integrated multi-omics for rapid rare disease diagnosis on a national scale. Nat. Med. 29, 1681–1691 (2023).
Aref-Eshghi, E. et al. Evaluation of DNA methylation episignatures for diagnosis and phenotype correlations in 42 Mendelian neurodevelopmental disorders. Am. J. Hum. Genet. 106, 356–370 (2020).
Kreitmaier, P., Katsoula, G. & Zeggini, E. Insights from multi-omics integration in complex disease primary tissues. Trends Genet. 39, 46–58 (2023).
Kremer, L. S. et al. Genetic diagnosis of Mendelian disorders via RNA sequencing. Nat. Commun. 8, 15824 (2017).
Cummings, B. B. et al. Improving genetic diagnosis in Mendelian disease with transcriptome sequencing. Sci. Transl. Med. 9, eaal5209 (2017).
Flusberg, B. A. et al. Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat. Methods 7, 461–465 (2010).
Sigurpalsdottir, B. D. et al. A comparison of methods for detecting DNA methylation from long-read sequencing of human genomes. Genome Biol. 25, 69 (2024).
Sanford Kobayashi, E. et al. Approaches to long-read sequencing in a clinical setting to improve diagnostic rate. Sci. Rep. 12, 16945 (2022).
Fukuda, H. et al. Father-to-offspring transmission of extremely long NOTCH2NLC repeat expansions with contractions: genetic and epigenetic profiling with long-read sequencing. Clin. Epigenetics 13, 204 (2021).
Mahmoud, M., Doddapaneni, H., Timp, W. & Sedlazeck, F. J. PRINCESS: comprehensive detection of haplotype resolved SNVs, SVs, and methylation. Genome Biol. 22, 268 (2021).
Stergachis, A. B., Debo, B. M., Haugen, E., Churchman, L. S. & Stamatoyannopoulos, J. A. Single-molecule regulatory architectures captured by chromatin fiber sequencing. Science 368, 1449–1454 (2020).
Abdulhay, N. J. et al. Massively multiplex single-molecule oligonucleosome footprinting. eLife 9, e59404 (2020).
Lee, I. et al. Simultaneous profiling of chromatin accessibility and methylation on human cell lines with nanopore sequencing. Nat. Methods 17, 1191–1199 (2020).
Shipony, Z. et al. Long-range single-molecule mapping of chromatin accessibility in eukaryotes. Nat. Methods 17, 319–327 (2020).
Vollger, M. R. et al. A haplotype-resolved view of human gene regulation. Preprint at bioRxiv https://doi.org/10.1101/2024.06.14.599122 (2024).
Jha, A. et al. DNA-m6A calling and integrated long-read epigenetic and genetic analysis with fibertools. Genome Res. 34, 1976–1986 (2024).
Dubocanin, D. et al. Conservation of chromatin organization within human and primate centromeres. Preprint at bioRxiv https://doi.org/10.1101/2023.04.20.537689 (2023).
Aneichyk, T. et al. Dissecting the causal mechanism of X-linked dystonia-parkinsonism by integrating genome and transcriptome assembly. Cell 172, 897–909 (2018).
Stergachis, A. B. et al. Full-length isoform sequencing for resolving the molecular basis of Charcot–Marie–Tooth 2A. Neurol. Genet. 9, e200090 (2023).
Al’Khafaji, A. M. et al. High-throughput RNA isoform sequencing using programmed cDNA concatenation. Nat. Biotechnol. 42, 582–586 (2024).
1000 Genomes Project Consortium. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Meuleman, W. et al. Index and biological spectrum of human DNase I hypersensitive sites. Nature 584, 244–251 (2020).
Bache, I., Brondum-Nielsen, K. & Tommerup, N. Genetic counseling in adult carriers of a balanced chromosomal rearrangement ascertained in childhood: experiences from a nationwide reexamination of translocation carriers. Genet. Med. 9, 185–187 (2007).
Giardino, D. et al. De novo balanced chromosome rearrangements in prenatal diagnosis. Prenat. Diagn. 29, 257–265 (2009).
Tsutsumi, M. et al. A female patient with retinoblastoma and severe intellectual disability carrying an X;13 balanced translocation without rearrangement in the RB1 gene: a case report. BMC Med. Genomics 12, 182 (2019).
Repetto, D. et al. Molecular dissection of neurobeachin function at excitatory synapses. Front. Synaptic Neurosci. 10, 28 (2018).
Holness, M. J. & Sugden, M. C. Regulation of pyruvate dehydrogenase complex activity by reversible phosphorylation. Biochem. Soc. Trans. 31, 1143–1151 (2003).
Heinz, S. et al. Transcription elongation can affect genome 3D structure. Cell 174, 1522–1536 (2018).
Ligtenberg, M. J. L. et al. Heritable somatic methylation and inactivation of MSH2 in families with Lynch syndrome due to deletion of the 3′ exons of TACSTD1. Nat. Genet. 41, 112–117 (2009).
Lyon, M. F. X-chromosome inactivation and human genetic disease. Acta Paediatr. Suppl. 91, 107–112 (2002).
Mulhern, M. S. et al. NBEA: developmental disease gene with early generalized epilepsy phenotypes. Ann. Neurol. 84, 788–795 (2018).
Bruel, A. L. et al. Autosomal recessive truncating MAB21L1 mutation associated with a syndromic scrotal agenesis. Clin. Genet. 91, 333–338 (2017).
Rad, A. et al. MAB21L1 loss of function causes a syndromic neurodevelopmental disorder with distinctive cerebellar, ocular, cranio facial and genital features (COFG syndrome). J. Med. Genet. 56, 332–339 (2019).
Kennerson, M. L. et al. A new locus for X-linked dominant Charcot–Marie–Tooth disease (CMTX6) is caused by mutations in the pyruvate dehydrogenase kinase isoenzyme 3 (PDK3) gene. Hum. Mol. Genet. 22, 1404–1416 (2013).
Patel, K. P., O’Brien, T. W., Subramony, S. H., Shuster, J. & Stacpoole, P. W. The spectrum of pyruvate dehydrogenase complex deficiency: clinical, biochemical and genetic features in 371 patients. Mol. Genet. Metab. 105, 34–43 (2012).
Tresenrider, A. et al. Single-cell sequencing of individual retinal organoids reveals determinants of cell-fate heterogeneity. Cell Rep. Methods 3, 100548 (2023).
Dubocanin, D. et al. Single-molecule architecture and heterogeneity of human telomeric DNA and chromatin. Preprint at bioRxiv https://doi.org/10.1101/2022.05.09.491186 (2022).
Poplin, R. et al. A universal SNP and small-indel variant caller using deep neural networks. Nat. Biotechnol. 36, 983 (2018).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170–175 (2021).
Akalin, A. et al. methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol. 13, R87 (2012).
Foox, J. et al. The SEQC2 epigenomics quality control (EpiQC) study. Genome Biol. 22, 332 (2021).
Fondrie, W. E. & Noble, W. S. Machine learning strategy that leverages large data sets to boost statistical power in small-scale experiments. J. Proteome. Res. 19, 1267–1274 (2020).
Akbari, V. et al. Genome-wide detection of imprinted differentially methylated regions using nanopore sequencing. eLife 11, e77898 (2022).
Eldred, K. C. et al. Thyroid hormone signaling specifies cone subtypes in human retinal organoids. Science 362, eaau6348 (2018).
MacEwen, M. J. S. et al. Evolutionary divergence reveals the molecular basis of EMRE dependence of the human MCU. Life Sci. Alliance 3, e202000718 (2020).
Hagege, H. et al. Quantitative analysis of chromosome conformation capture assays (3C-qPCR). Nat. Protoc. 2, 1722–1733 (2007).
Naumova, N., Smith, E. M., Zhan, Y. & Dekker, J. Analysis of long-range chromatin interactions using chromosome conformation capture. Methods 58, 192–203 (2012).
Vollger, M. R. et al. Release version for the multi-ome manuscript (multi-ome-v0.1). Zenodo https://doi.org/10.5281/zenodo.12701477 (2024).
Vollger, M. R. et al. mrvollger/k-mer-variant-phasing: 0.0.1 (0.0.1). Zenodo https://doi.org/10.5281/zenodo.10655527 (2024).
Acknowledgements
We thank C. C. Lambert, A. Souppe, H. Dhillon and S. Zhang (all from PacBio) for assistance with library and sequencing preparations. A.B.S. holds a Career Award for Medical Scientists from the Burroughs Wellcome Fund and is a Pew Biomedical Scholar. This study was supported by the NIH (grants 1DP5OD029630 and 1U01HG010233), in addition to funds from the Collagen Diagnostic Laboratory, University of Washington, and the Brotman Baty Institute for Precision Medicine. Sequencing and data analysis were provided by the University of Washington Center for Rare Disease Research, with support from National Human Genome Research Institute (grants U01 HG011744 and U24 HG011746). Research from the UDN reported in this publication was supported by the National Institute of Neurological Disorders and Stroke of the NIH (awards U01HG010233, U01HG007942 and U01HG007530). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. M.R.V. was supported by a training grant (T32) from the NIH (2T32GM007454-46). The Eichler Lab provided the CH17-6D16 (chrX) and CH17-139G1 (chr13) BAC clones.
Author information
Authors and Affiliations
Consortia
Contributions
A.B.S., M.R.V. and J.K. conceived the overall method design, coordinated activities from co-authors and wrote the first draft of the manuscript. M.R.V., E.S. and A.B.S. contributed to the analysis of Fiber-seq and MAS-seq data. K.C.E. and T.A.R. contributed to the generation of patient-derived retinal organoids, as well as the evaluation of RNA-seq data in retinal organoids. Y.S. contributed to the western blots, as well as mitochondrial functional studies. Y.M. created the Hia5 enzyme. E.E.B., S. Sheppeard, S. Strohbehn, S.M.S., E.A.R. and A.M.W. contributed to the analysis of genomic data. U.S. and P.H.B. contributed to the Sanger validation of the translocation breakpoints. K.M.D., G.P.J., F.M.H., M.J.B., A.L.Y., J.O., K.A.L., I.G., B.L.D., S.C. and A.A. contributed to patient phenotyping. M.H.-P. contributed to getting IRB approval for this study. J.R., A.B.S., K.C.E., M.K. and Y.H.C. contributed to the generation of Fiber-seq and mRNA samples. J.G.U. contributed to the mRNA concatenation protocol development and transcriptome data analysis. C.T.S. and A.M.W. contributed to CpG methylation calling and IGV visualization. I.J.M. and K.M.M. coordinated and performed the sequencing runs. All authors revised the manuscript.
Corresponding author
Ethics declarations
Competing interests
J.K., J.G.U., C.T.S., A.M.W., M.K. and I.J.M. are full-time employees at PacBio, a company developing single-molecule sequencing technologies. A.B.S. is a co-inventor on a patent relating to the Fiber-seq method (US17/995,058). The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Genetics thanks Sarah Stenton and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 IGV view of integrated long-read multi-ome data.
IGV view showing an example genomic region for GM12878 (from top to bottom) the haplotype-resolved genome, CpG methylome, chromatin and full-length transcript annotations.
Extended Data Fig. 2 Haplotype-specific chromatin architectures.
Volcano plot showing the absolute difference in the percentage of chromatin fibers with chromatin accessibility for each peak genome-wide between the paternal and maternal haplotypes for GM12878, GM20129, HG02630 and GM24385. Peaks are divided into whether they are present on an autosome or the X chromosome, with the exception of GM24385, which is 46,XY. P value was calculated using a two-sided Fisher’s exact test without adjustment for multiple comparisons. Blue dash represents nominal significance line (p < 0.05), and red dash represents the Benjamini–Hochberg FDR correction significance line (FDR < 0.05). Peaks corresponding to known imprinted loci with nominally significant scores are denoted by red crosses.
Extended Data Fig. 3 Identification of haplotype-specific chromatin architectures within HLA locus.
a, View of single-molecule haplotype-resolved genetic information, CpG methylation information and chromatin accessibility information for a heterozygous single-nucleotide variant (SNP) identified for GM12878 within the HLA locus. Note that all information is derived from the same sequencing reads. Denoted below is the ___location of a predicted CTCF binding element, and immediately above it are two fibers that demonstrate single-molecule protein occupancy at this site. b, Quantification of CpG methylation surrounding this SNP (top), as well as the percentage of fibers with a FIRE element overlapping that SNP (bottom), by haplotype. The box bounds the interquartile range (IQR) divided by the median, and the whiskers extend to the minima and maxima of the data. *Top, p value = 0.00077 (one-sided paired t test without adjustment for multiple comparisons, n = 8). *Bottom, p value = 0.00061 (one-sided Fisher’s exact test without adjustment for multiple comparisons).
Extended Data Fig. 4 Haplotype-specific chromatin accessibility of NBEA promoter and transcript NMD.
a, Per-molecule chromatin accessibility of the NBEA promoter along both the der(13) and intact chr13 chromosomes within patient-derived fibroblasts. Chromatin accessibility was measured using Fiber-seq. b, Schematic representation for resolving whether the fusion transcripts are being subjected to nonsense-mediated decay (NMD). Specifically, long-read transcript sequencing was performed on samples before and after treatment with the NMD inhibitor cycloheximide. The libraries were sequenced to similar depths, and the total number of transcripts corresponding to the NBEA–chrX fusion transcripts was quantified in both samples. c, Same as b, but for the PDK3–MAB21L1 fusion transcript.
Extended Data Fig. 5 PDK3 protein levels in patient cells.
a, Western blot of phospho-PDH and PDH in HEK293 cells, as well as HEK293 cells containing a viral expression vector for the intact PDK3 protein, or the PDK3–MAB21L1 fusion protein. b, Western blot of PDK3 in HEK293 cells, as well as HEK293 cells containing a viral expression vector for the intact PDK3 protein, or the PDK3–MAB21L1 fusion protein. * indicates protein bands selectively present in the PDK3-MAB21L1 sample. c, Western blot of PDK3 and actin protein levels in patient-derived fibroblasts, as well as control age-matched fibroblast cultures and retinal organoids. All experiments in the figure were performed once.
Extended Data Fig. 6 Cell-selective expression of MAB21L1.
a, (Top) Bar plot showing the bulk tissue RNA expression of MAB21L1 (ENSG00000180660) from various GTEx samples are displayed by www.proteinatlas.org. (Bottom) Bulk expression of NBEA, MAB21L1 and PDK3 during retinal organoid differentiation as a function of the age of the organoid. TPM, transcripts per million. b, Volcano plot showing the absolute difference in the percentage of chromatin fibers with chromatin accessibility for each peak genome-wide between the paternal and maternal haplotypes for the retinal organoid Fiber-seq data from UDN318336. The MAB21L1 regulatory elements that are disrupted on der(13) are highlighted in red. Blue dash represents the nominal significance line (p < 0.05; two-sided Fisher’s exact test without adjustment for multiple comparisons).
Extended Data Fig. 7 3C chromatin interactions between PDK3 promoter and MAB21L1 enhancer.
Results from chromatin conformation capture (3C) experiment using induced pluripotent stem cells (iPSCs) and retinal organoids from UDN318336. For the iPSC 3C experiment, displayed is the relative enrichment of different regions of the PDK3 gene for interactions with the MAB21L1 enhancer. For the retinal organoid 3C experiment, as a limited amount of 3C library was obtained, displayed are the qualitative results of qPCR for each of the tested interactions.
Extended Data Fig. 8 PDK3 protein levels in patient retinal organoids and PHD activity in iPSCs.
a, Western blot images showing quantification of PDK3, phospho-PDH, PDH and β-actin levels in patient-derived retinal organoids, as well as age-matched retinal organoids from a separate unaffected and unrelated individual. Data from two separate protein extractions are displayed. Quantification of phospho-PDH to PDH level is shown at the bottom. b, Pyruvate dehydrogenase activity assay on iPSCs from UND318336, as well as control iPSCs. Data are presented as mean values ± SEM, and results are from three replicates (n = 3). NS, not significant (p value = 0.11), unpaired two-sided t test.
Extended Data Fig. 9 Comparison with prior chromosome X;13 translocations.
Ideogram showing the translocation breakpoints and derivative chromosomes for this case, as well as a previously published case who similarly had bilateral retinoblastomas. The translocation breakpoints for the previously published case are estimated based on the karyotype.
Extended Data Fig. 10 Allelic imbalance differences along chr13.
a, Swarm plot showing the overall haplotype imbalance in chromatin accessibility along autosomes (except for chromosomes 13 and 14), chromosome X, chromosome 14 and two portions of chromosome 13 within GM12878 and GM24385 cells. P value calculated using a one-sided Mann–Whitney U test without adjustments for multiple comparisons. b, Swarm plot showing the overall haplotype imbalance in transcript production along autosomes (except for chromosomes 13 and 14), chromosome X, chromosome 14 and two portions of chromosome 13 within UDN318336 and GM12878 cells. P value calculated using a one-sided Mann–Whitney U test without adjustments for multiple comparisons. c, Swarm plot showing the haplotype imbalance in chromatin accessibility at different regions of chromosome 13 in fibroblasts from UDN318336. P value calculated using a one-sided Mann–Whitney U test without adjustments for multiple comparisons.
Supplementary information
Supplementary Information
Supplementary Tables 1–3.
Supplementary Table 4
Complete lists of consortia members.
Source data
Source Data Fig. 3
Statistical source data.
Source Data Fig. 4
Statistical source data.
Source Data Fig. 4 and Extended Data Figs. 5 and 8
Unprocessed western blots and/or gels.
Source Data Fig. 5
Statistical source data.
Source Data Extended Data Fig. 2
Statistical source data.
Source Data Extended Data Fig. 3
Statistical source data.
Source Data Extended Data Fig. 4
Statistical source data.
Source Data Extended Data Fig. 6
Statistical source data.
Source Data Extended Data Fig. 7
Statistical source data.
Source Data Extended Data Fig. 8
Statistical source data.
Source Data Extended Data Fig. 10
Statistical source data.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Vollger, M.R., Korlach, J., Eldred, K.C. et al. Synchronized long-read genome, methylome, epigenome and transcriptome profiling resolve a Mendelian condition. Nat Genet 57, 469–479 (2025). https://doi.org/10.1038/s41588-024-02067-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41588-024-02067-0