TAGCNA: a method to identify significant consensus events of copy number alterations in cancer

PLoS One. 2012;7(7):e41082. doi: 10.1371/journal.pone.0041082. Epub 2012 Jul 18.

Abstract

Somatic copy number alteration (CNA) is a common phenomenon in cancer genome. Distinguishing significant consensus events (SCEs) from random background CNAs in a set of subjects has been proven to be a valuable tool to study cancer. In order to identify SCEs with an acceptable type I error rate, better computational approaches should be developed based on reasonable statistics and null distributions. In this article, we propose a new approach named TAGCNA for identifying SCEs in somatic CNAs that may encompass cancer driver genes. TAGCNA employs a peel-off permutation scheme to generate a reasonable null distribution based on a prior step of selecting tag CNA markers from the genome being considered. We demonstrate the statistical power of TAGCNA on simulated ground truth data, and validate its applicability using two publicly available cancer datasets: lung and prostate adenocarcinoma. TAGCNA identifies SCEs that are known to be involved with proto-oncogenes (e.g. EGFR, CDK4) and tumor suppressor genes (e.g. CDKN2A, CDKN2B), and provides many additional SCEs with potential biological relevance in these data. TAGCNA can be used to analyze the significance of CNAs in various cancers. It is implemented in R and is freely available at http://tagcna.sourceforge.net/.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / genetics*
  • Algorithms
  • Computer Simulation
  • Databases, Genetic
  • Gene Deletion
  • Gene Dosage*
  • Genome
  • Genome, Human
  • Humans
  • Lung Neoplasms / genetics*
  • Male
  • Models, Genetic
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods
  • Prostatic Neoplasms / genetics*
  • Reproducibility of Results