Single-molecule, full-length transcript isoform sequencing reveals disease-associated RNA isoforms in cardiomyocytes

Nat Commun. 2021 Jul 9;12(1):4203. doi: 10.1038/s41467-021-24484-z.

Abstract

Alternative splicing generates differing RNA isoforms that govern phenotypic complexity of eukaryotes. Its malfunction underlies many diseases, including cancer and cardiovascular diseases. Comparative analysis of RNA isoforms at the genome-wide scale has been difficult. Here, we establish an experimental and computational pipeline that performs de novo transcript annotation and accurately quantifies transcript isoforms from cDNA sequences with a full-length isoform detection accuracy of 97.6%. We generate a searchable, quantitative human transcriptome annotation with 31,025 known and 5,740 novel transcript isoforms ( http://steinmetzlab.embl.de/iBrowser/ ). By analyzing the isoforms in the presence of RNA Binding Motif Protein 20 (RBM20) mutations associated with aggressive dilated cardiomyopathy (DCM), we identify 121 differentially expressed transcript isoforms in 107 cardiac genes. Our approach enables quantitative dissection of complex transcript architecture instead of mere identification of inclusion or exclusion of individual exons, as exemplified by the discovery of IMMT isoforms mis-spliced by RBM20 mutations. Thereby we achieve a path to direct differential expression testing independent of an existing annotation of transcript isoforms, providing more immediate biological interpretation and higher resolution transcriptome comparisons.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • CRISPR-Cas Systems / genetics
  • Cardiomyopathy, Dilated / genetics*
  • Cardiomyopathy, Dilated / pathology
  • Cell Differentiation / genetics
  • Cell Line
  • Feasibility Studies
  • Gene Editing
  • Humans
  • Induced Pluripotent Stem Cells
  • Mitochondrial Proteins / genetics
  • Molecular Sequence Annotation
  • Muscle Proteins / genetics
  • Mutation
  • Myocytes, Cardiac / pathology*
  • RNA Isoforms / genetics
  • RNA, Guide, CRISPR-Cas Systems
  • RNA-Binding Proteins / genetics*
  • RNA-Seq / methods*

Substances

  • IMMT protein, human
  • Mitochondrial Proteins
  • Muscle Proteins
  • RNA Isoforms
  • RNA-Binding Proteins
  • ribonucleic acid binding motif protein 20, human