Predicting failure rate of PCR in large genomes

Reidar Andreson; Tõnu Möls; Maido Remm

doi:10.1093/nar/gkn290

Predicting failure rate of PCR in large genomes

Nucleic Acids Res. 2008 Jun;36(11):e66. doi: 10.1093/nar/gkn290. Epub 2008 May 20.

Authors

Reidar Andreson¹, Tõnu Möls, Maido Remm

Affiliation

¹ Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu and Estonian Biocentre, Tartu, Estonia.

Abstract

We have developed statistical models for estimating the failure rate of polymerase chain reaction (PCR) primers using 236 primer sequence-related factors. The model involved 1314 primer pairs and is based on more than 80 000 PCR experiments. We found that the most important factor in determining PCR failure is the number of predicted primer-binding sites in the genomic DNA. We also compared different ways of defining primer-binding sites (fixed length word versus thermodynamic model; exact match versus matches including 1-2 mismatches). We found that the most efficient prediction of PCR failure rates can be achieved using a combination of four factors (number of primer-binding sites counted in different ways plus GC% of the primer) combined into single statistical model GM1. According to our estimations from experimental data, the GM1 model can reduce the average failure rate of PCR primers nearly 3-fold (from 17% to 6%). The GM1 model can easily be implemented in software to premask genome sequences for potentially failing PCR primers, thus improving large-scale PCR-primer design.

Publication types

Evaluation Study
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
DNA Primers / chemistry*
Genome, Human
Genomics*
Humans
Models, Statistical*
Polymerase Chain Reaction / methods*
Software

Substances

DNA Primers