Integration of genome scale data for identifying new players in colorectal cancer

World J Gastroenterol. 2016 Jan 14;22(2):534-45. doi: 10.3748/wjg.v22.i2.534.

Abstract

Colorectal cancers (CRCs) display a wide variety of genomic aberrations that may be either causally linked to their development and progression, or might serve as biomarkers for their presence. Recent advances in rapid high-throughput genetic and genomic analysis have helped to identify a plethora of alterations that can potentially serve as new cancer biomarkers, and thus help to improve CRC diagnosis, prognosis, and treatment. Each distinct data type (copy number variations, gene and microRNAs expression, CpG island methylation) provides an investigator with a different, partially independent, and complementary view of the entire genome. However, elucidation of gene function will require more information than can be provided by analyzing a single type of data. The integration of knowledge obtained from different sources is becoming increasingly essential for obtaining an interdisciplinary view of large amounts of information, and also for cross-validating experimental results. The integration of numerous types of genetic and genomic data derived from public sources, and via the use of ad-hoc bioinformatics tools and statistical methods facilitates the discovery and validation of novel, informative biomarkers. This combinatory approach will also enable researchers to more accurately and comprehensively understand the associations between different biologic pathways, mechanisms, and phenomena, and gain new insights into the etiology of CRC.

Keywords: Colorectal cancer; Copy number variations; Data integration; Gene expression; Methylome; miRNA expression.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Animals
  • Biomarkers, Tumor / genetics*
  • Biomarkers, Tumor / metabolism
  • Colorectal Neoplasms / genetics*
  • Colorectal Neoplasms / metabolism
  • Colorectal Neoplasms / pathology
  • Computational Biology*
  • DNA Copy Number Variations
  • DNA Methylation
  • Databases, Genetic
  • Epigenesis, Genetic
  • Gene Dosage
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Genetic Predisposition to Disease
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • MicroRNAs / genetics
  • MicroRNAs / metabolism
  • Phenotype
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Signal Transduction
  • Systems Integration*

Substances

  • Biomarkers, Tumor
  • MicroRNAs
  • RNA, Messenger