CNGPLD: case-control copy-number analysis using Gaussian process latent difference

Bioinformatics. 2022 Apr 12;38(8):2096-2101. doi: 10.1093/bioinformatics/btac096.

Abstract

Motivation: Cross-sectional analyses of primary cancer genomes have identified regions of recurrent somatic copy-number alteration, many of which result from positive selection during cancer formation and contain driver genes. However, no effective approach exists for identifying genomic loci under significantly different degrees of selection in cancers of different subtypes, anatomic sites or disease stages.

Results: CNGPLD is a new tool for performing case-control somatic copy-number analysis that facilitates the discovery of differentially amplified or deleted copy-number aberrations in a case group of cancer compared with a control group of cancer. This tool uses a Gaussian process statistical framework in order to account for the covariance structure of copy-number data along genomic coordinates and to control the false discovery rate at the region level.

Availability and implementation: CNGPLD is freely available at https://bitbucket.org/djhshih/cngpld as an R package.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Case-Control Studies
  • Cross-Sectional Studies
  • DNA Copy Number Variations
  • Genome*
  • Genomics
  • Humans
  • Neoplasms* / genetics
  • Software