CancerSubtypes: an R/Bioconductor package for molecular cancer subtype identification, validation and visualization

Bioinformatics. 2017 Oct 1;33(19):3131-3133. doi: 10.1093/bioinformatics/btx378.

Abstract

Summary: Identifying molecular cancer subtypes from multi-omics data is an important step in the personalized medicine. We introduce CancerSubtypes, an R package for identifying cancer subtypes using multi-omics data, including gene expression, miRNA expression and DNA methylation data. CancerSubtypes integrates four main computational methods which are highly cited for cancer subtype identification and provides a standardized framework for data pre-processing, feature selection, and result follow-up analyses, including results computing, biology validation and visualization. The input and output of each step in the framework are packaged in the same data format, making it convenience to compare different methods. The package is useful for inferring cancer subtypes from an input genomic dataset, comparing the predictions from different well-known methods and testing new subtype discovery methods, as shown with different application scenarios in the Supplementary Material.

Availability and implementation: The package is implemented in R and available under GPL-2 license from the Bioconductor website (http://bioconductor.org/packages/CancerSubtypes/).

Contact: thuc.le@unisa.edu.au or jiuyong.li@unisa.edu.au.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Computer Graphics
  • DNA Methylation
  • Gene Expression
  • Genomics
  • Humans
  • MicroRNAs / metabolism
  • Neoplasms / classification*
  • Neoplasms / genetics*
  • Neoplasms / metabolism
  • Software*

Substances

  • MicroRNAs