Discovery of A Polymorphic Gene Fusion via Bottom-Up Chimeric RNA Prediction

bioRxiv [Preprint]. 2023 Feb 3:2023.02.02.526864. doi: 10.1101/2023.02.02.526864.

Abstract

Gene fusions and their chimeric products are typically considered hallmarks of cancer. However, recent studies have found chimeric transcripts in non-cancer tissues and cell lines. In addition, efforts to annotate structural variation at large scale have found examples of gene fusions with potential to produce chimeric transcripts in normal tissues. In this report, we provide a means for targeting population-specific chimeric RNAs to enrich for those generated by gene fusion events. We identify 57 such chimeric RNAs from the GTEx cohort, including SUZ12P1-CRLF3 and TFG-ADGRG7 , whose distribution we assessed across the populations of the 1000 Genomes Project. We reveal that SUZ12P1-CRLF3 results from a common complex structural variant in populations with African heritage, and identify its likely mechanism for formation. Additionally, we utilize a large cohort of clinical samples to characterize the SUZ12P1-CRLF3 chimeric RNA, and find an association between the variant and indications of Neurofibramatosis Type I. We present this gene fusion as a case study for identifying hard-to-find and potentially functional structural variants by selecting for those which produce population-specific fusion transcripts.

Key points: - Discovery of 57 polymorphic chimeric RNAs- Characterization of SUZ12P1-CRLF3 polymorphic chimeric RNA and corresponding rearrangement- Novel bottom-up approach to identify structural variants which produce transcribed gene fusions.

Publication types

  • Preprint