Genome-wide colocalization of RNA-DNA interactions and fusion RNA pairs

Proc Natl Acad Sci U S A. 2019 Feb 19;116(8):3328-3337. doi: 10.1073/pnas.1819788116. Epub 2019 Feb 4.

Abstract

Fusion transcripts are used as biomarkers in companion diagnoses. Although more than 15,000 fusion RNAs have been identified from diverse cancer types, few common features have been reported. Here, we compared 16,410 fusion transcripts detected in cancer (from a published cohort of 9,966 tumor samples of 33 cancer types) with genome-wide RNA-DNA interactions mapped in two normal, noncancerous cell types [using iMARGI, an enhanced version of the mapping of RNA-genome interactions (MARGI) assay]. Among the top 10 most significant RNA-DNA interactions in normal cells, 5 colocalized with the gene pairs that formed fusion RNAs in cancer. Furthermore, throughout the genome, the frequency of a gene pair to exhibit RNA-DNA interactions is positively correlated with the probability of this gene pair to present documented fusion transcripts in cancer. To test whether RNA-DNA interactions in normal cells are predictive of fusion RNAs, we analyzed these in a validation cohort of 96 lung cancer samples using RNA sequencing (RNA-seq). Thirty-seven of 42 fusion transcripts in the validation cohort were found to exhibit RNA-DNA interactions in normal cells. Finally, by combining RNA-seq, single-molecule RNA FISH, and DNA FISH, we detected a cancer sample with EML4-ALK fusion RNA without forming the EML4-ALK fusion gene. Collectively, these data suggest an RNA-poise model, where spatial proximity of RNA and DNA could poise for the creation of fusion transcripts.

Keywords: RNA-poise model; RNA–DNA interactions; fusion transcripts.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • DNA / genetics*
  • Genome, Human / genetics*
  • Humans
  • In Situ Hybridization, Fluorescence
  • Lung Neoplasms / genetics
  • Lung Neoplasms / pathology
  • Neoplasms / genetics
  • Neoplasms / pathology
  • Oncogene Proteins, Fusion / genetics*
  • RNA / genetics*
  • Sequence Analysis, RNA

Substances

  • EML4-ALK fusion protein, human
  • Oncogene Proteins, Fusion
  • RNA
  • DNA