Inflammatory breast cancer biomarker identification by simultaneous TGIRT-seq profiling of coding and non-coding RNAs in tumors and blood

medRxiv [Preprint]. 2023 May 30:2023.05.26.23290469. doi: 10.1101/2023.05.26.23290469.

Abstract

Inflammatory breast cancer (IBC) is the most aggressive and lethal breast cancer subtype, but lags in biomarker identification. Here, we used an improved Thermostable Group II Intron Reverse Transcriptase RNA sequencing (TGIRT-seq) method to simultaneously profile coding and non-coding RNAs from tumors, PBMCs, and plasma of IBC and non-IBC patients and healthy donors. Besides RNAs from known IBC-relevant genes, we identified hundreds of other overexpressed coding and non-coding RNAs (p≤0.001) in IBC tumors and PBMCs, including higher proportions with elevated intron-exon depth ratios (IDRs), likely reflecting enhanced transcription resulting in accumulation of intronic RNAs. As a consequence, differentially represented protein-coding gene RNAs in IBC plasma were largely intron RNA fragments, whereas those in healthy donor and non-IBC plasma were largely fragmented mRNAs. Potential IBC biomarkers in plasma included T-cell receptor pre-mRNA fragments traced to IBC tumors and PBMCs; intron RNA fragments correlated with high IDR genes; and LINE-1 and other retroelement RNAs that we found globally up-regulated in IBC and preferentially enriched in plasma. Our findings provide new insights into IBC and demonstrate advantages of broadly analyzing transcriptomes for biomarker identification. The RNA-seq and data analysis methods developed for this study may be broadly applicable to other diseases.

Keywords: LINE-1; RNA splicing; T cell receptor; cancer; diagnostics; high-throughput RNA sequencing; hypertranscription; inflammatory response; intron RNA; plasma RNAs; retroelements; reverse transcriptase.

Publication types

  • Preprint