MSMAD: a computationally efficient method for the analysis of noisy array CGH data

Bioinformatics. 2009 Mar 15;25(6):703-13. doi: 10.1093/bioinformatics/btp022. Epub 2009 Jan 15.

Abstract

Motivation: Genome analysis has become one of the most important tools for understanding the complex process of cancerogenesis. With increasing resolution of CGH arrays, the demand for computationally efficient algorithms arises, which are effective in the detection of aberrations even in very noisy data.

Results: We developed a rather simple, non-parametric technique of high computational efficiency for CGH array analysis that adopts a median absolute deviation concept for breakpoint detection, comprising median smoothing for pre-processing. The resulting algorithm has the potential to outperform any single smoothing approach as well as several recently proposed segmentation techniques. We show its performance through the application of simulated and real datasets in comparison to three other methods for array CGH analysis.

Implementation: Our approach is implemented in the R-language and environment for statistical computing (version 2.6.1 for Windows, R-project, 2007). The code is available at: http://www.iba.muni.cz/~budinska/msmad.html.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Comparative Genomic Hybridization / methods*
  • Computational Biology / methods*
  • Oligonucleotide Array Sequence Analysis / methods
  • Programming Languages