Spatial transformation of multi-omics data unlocks novel insights into cancer biology

Elife. 2023 Sep 5:12:RP87133. doi: 10.7554/eLife.87133.

Abstract

The application of next-generation sequencing (NGS) has transformed cancer research. As costs have decreased, NGS has increasingly been applied to generate multiple layers of molecular data from the same samples, covering genomics, transcriptomics, and methylomics. Integrating these types of multi-omics data in a combined analysis is now becoming a common issue with no obvious solution, often handled on an ad hoc basis, with multi-omics data arriving in a tabular format and analyzed using computationally intensive statistical methods. These methods particularly ignore the spatial orientation of the genome and often apply stringent p-value corrections that likely result in the loss of true positive associations. Here, we present GENIUS (GEnome traNsformatIon and spatial representation of mUltiomicS data), a framework for integrating multi-omics data using deep learning models developed for advanced image analysis. The GENIUS framework is able to transform multi-omics data into images with genes displayed as spatially connected pixels and successfully extract relevant information with respect to the desired output. We demonstrate the utility of GENIUS by applying the framework to multi-omics datasets from the Cancer Genome Atlas. Our results are focused on predicting the development of metastatic cancer from primary tumors, and demonstrate how through model inference, we are able to extract the genes which are driving the model prediction and are likely associated with metastatic disease progression. We anticipate our framework to be a starting point and strong proof of concept for multi-omics data transformation and analysis without the need for statistical correction.

Keywords: bladder cancer; cancer; cancer biology; computational biology; deep learning; human; immunotherapy; integrated gradients; multi-omics; systems biology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Profiling
  • Genomics
  • High-Throughput Nucleotide Sequencing
  • Image Processing, Computer-Assisted
  • Multiomics*
  • Neoplasms*

Associated data

  • dbGaP/phs000178.v11.p8

Grants and funding

The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication.