Performance assessment of copy number microarray platforms using a spike-in experiment

Bioinformatics. 2011 Apr 15;27(8):1052-60. doi: 10.1093/bioinformatics/btr106.

Abstract

Motivation: Changes in the copy number of chromosomal DNA segments [copy number variants (CNVs)] have been implicated in human variation, heritable diseases and cancers. Microarray-based platforms are the current established technology of choice for studies reporting these discoveries and constitute the benchmark against which emergent sequence-based approaches will be evaluated. Research that depends on CNV analysis is rapidly increasing, and systematic platform assessments that distinguish strengths and weaknesses are needed to guide informed choice.

Results: We evaluated the sensitivity and specificity of six platforms, provided by four leading vendors, using a spike-in experiment. NimbleGen and Agilent platforms outperformed Illumina and Affymetrix in accuracy and precision of copy number dosage estimates. However, Illumina and Affymetrix algorithms that leverage single nucleotide polymorphism (SNP) information make up for this disadvantage and perform well at variant detection. Overall, the NimbleGen 2.1M platform outperformed others, but only with the use of an alternative data analysis pipeline to the one offered by the manufacturer.

Availability: The data is available from http://rafalab.jhsph.edu/cnvcomp/.

Contact: pevsner@jhmi.edu; fspencer@jhmi.edu; rafa@jhu.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • DNA Copy Number Variations*
  • Female
  • Humans
  • Male
  • Oligonucleotide Array Sequence Analysis / methods*
  • Polymorphism, Single Nucleotide
  • Sensitivity and Specificity