Discovery of a polymorphic gene fusion via bottom-up chimeric RNA prediction

Nucleic Acids Res. 2024 May 8;52(8):4409-4421. doi: 10.1093/nar/gkae258.

Abstract

Gene fusions and their chimeric products are commonly linked with cancer. However, recent studies have found chimeric transcripts in non-cancer tissues and cell lines. Large-scale efforts to annotate structural variations have identified gene fusions capable of generating chimeric transcripts even in normal tissues. In this study, we present a bottom-up approach targeting population-specific chimeric RNAs, identifying 58 such instances in the GTEx cohort, including notable cases such as SUZ12P1-CRLF3, TFG-ADGRG7 and TRPM4-PPFIA3, which possess distinct patterns across different ancestry groups. We provide direct evidence for an additional 29 polymorphic chimeric RNAs with associated structural variants, revealing 13 novel rare structural variants. Additionally, we utilize the All of Us dataset and a large cohort of clinical samples to characterize the association of the SUZ12P1-CRLF3-causing variant with patient phenotypes. Our study showcases SUZ12P1-CRLF3 as a representative example, illustrating the identification of elusive structural variants by focusing on those producing population-specific fusion transcripts.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Gene Fusion*
  • Humans
  • Neoplasm Proteins / genetics
  • Neoplasms / genetics
  • Oncogene Proteins, Fusion / genetics
  • Polycomb Repressive Complex 2 / genetics
  • Polycomb Repressive Complex 2 / metabolism
  • Polymorphism, Genetic
  • RNA / genetics
  • TRPM Cation Channels / genetics
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • Neoplasm Proteins
  • Oncogene Proteins, Fusion
  • SUZ12 protein, human
  • Polycomb Repressive Complex 2
  • RNA
  • Transcription Factors
  • TRPM Cation Channels