Predicting failure rate of PCR in large genomes

Nucleic Acids Res. 2008 Jun;36(11):e66. doi: 10.1093/nar/gkn290. Epub 2008 May 20.

Abstract

We have developed statistical models for estimating the failure rate of polymerase chain reaction (PCR) primers using 236 primer sequence-related factors. The model involved 1314 primer pairs and is based on more than 80 000 PCR experiments. We found that the most important factor in determining PCR failure is the number of predicted primer-binding sites in the genomic DNA. We also compared different ways of defining primer-binding sites (fixed length word versus thermodynamic model; exact match versus matches including 1-2 mismatches). We found that the most efficient prediction of PCR failure rates can be achieved using a combination of four factors (number of primer-binding sites counted in different ways plus GC% of the primer) combined into single statistical model GM1. According to our estimations from experimental data, the GM1 model can reduce the average failure rate of PCR primers nearly 3-fold (from 17% to 6%). The GM1 model can easily be implemented in software to premask genome sequences for potentially failing PCR primers, thus improving large-scale PCR-primer design.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • DNA Primers / chemistry*
  • Genome, Human
  • Genomics*
  • Humans
  • Models, Statistical*
  • Polymerase Chain Reaction / methods*
  • Software

Substances

  • DNA Primers