Comprehensive RNA dataset of tissue and plasma from patients with esophageal cancer or precursor lesions

Sci Data. 2022 Mar 14;9(1):86. doi: 10.1038/s41597-022-01176-x.

Abstract

In the past decades, the incidence of esophageal adenocarcinoma has increased dramatically in Western populations. Better understanding of disease etiology along with the identification of novel prognostic and predictive biomarkers are urgently needed to improve the dismal survival probabilities. Here, we performed comprehensive RNA (coding and non-coding) profiling in various samples from 17 patients diagnosed with esophageal adenocarcinoma, high-grade dysplastic or non-dysplastic Barrett's esophagus. Per patient, a blood plasma sample, and a healthy and disease esophageal tissue sample were included. In total, this comprehensive dataset consists of 102 sequenced libraries from 51 samples. Based on this data, 119 expression profiles are available for three biotypes, including miRNA (51), mRNA (51) and circRNA (17). This unique resource allows for discovery of novel biomarkers and disease mechanisms, comparison of tissue and liquid biopsy profiles, integration of coding and non-coding RNA patterns, and can serve as a validation dataset in other RNA landscaping studies. Moreover, structural RNA differences can be identified in this dataset, including protein coding mutations, fusion genes, and circular RNAs.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma* / blood
  • Adenocarcinoma* / genetics
  • Barrett Esophagus* / blood
  • Barrett Esophagus* / genetics
  • Biomarkers
  • Disease Progression
  • Esophageal Neoplasms* / blood
  • Esophageal Neoplasms* / genetics
  • Humans
  • MicroRNAs* / genetics
  • Plasma / metabolism

Substances

  • Biomarkers
  • MicroRNAs

Supplementary concepts

  • Adenocarcinoma Of Esophagus