Current advances in comprehensive omics data mining for oncology and cancer research

Biochim Biophys Acta Rev Cancer. 2024 Jan;1879(1):189030. doi: 10.1016/j.bbcan.2023.189030. Epub 2023 Nov 25.

Abstract

The availability of a large amount of multiomics data enables data-driven discovery studies on cancers. High-throughput data on mutations, gene/protein expression, immune scores (tumor-infiltrating cells), drug screening, and RNAi (shRNAs and CRISPRs) screening are major integrated components of patient samples and cell line datasets. Improvements in data access and user interfaces make it easy for general scientists to carry out their data mining practices on integrated multiomics data platforms without computational expertise. Here, we summarize the extent of data integration and functionality of several portals and software that provide integrated multiomics data mining platforms for all cancer studies. Recent progress includes programming interfaces (APIs) for customized data mining. Precalculated datasets assist noncomputational users in quickly browsing data associations. Furthermore, stand-alone software provides fast calculations and smart functions, guiding optimal sampling and filtering options for the easy discovery of significant data associations. These efforts improve the utility of cancer omics big data for noncomputational users at all levels of cancer research. In the present review, we aim to provide analytical information guiding general scientists to find and utilize data mining tools for their research.

Keywords: Biomarkers; Data mining; Drug synergy; Prognostic markers; Smart software; Target discovery.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining
  • Humans
  • Medical Oncology
  • Neoplasms* / genetics
  • Proteomics*
  • Software