Majority of divergence between closely related DNA samples is due to indels

Proc Natl Acad Sci U S A. 2003 Apr 15;100(8):4661-5. doi: 10.1073/pnas.0330964100. Epub 2003 Apr 2.

Abstract

It was recently shown that indels are responsible for more than twice as many unmatched nucleotides as are base substitutions between samples of chimpanzee and human DNA. A larger sample has now been examined and the result is similar. The number of indels is approximately 1/12th of the number of base substitutions and the average length of the indels is 36 nt, including indels up to 10 kb. The ratio (R(u)) of unpaired nucleotides attributable to indels to those attributable to substitutions is 3.0 for this 2 million-nt chimp DNA sample compared with human. There is similar evidence of a large value of R(u) for sea urchins from the polymorphism of a sample of Strongylocentrotus purpuratus DNA (R(u) = 3-4). Other work indicates that similarly, per nucleotide affected, large differences are seen for indels in the DNA polymorphism of the plant Arabidopsis thaliana (R(u) = 51). For the insect Drosophila melanogaster a high value of R(u) (4.5) has been determined. For the nematode Caenorhabditis elegans the polymorphism data are incomplete but high values of R(u) are likely. Comparison of two strains of Escherichia coli O157:H7 shows a preponderance of indels. Because these six examples are from very distant systematic groups the implication is that in general, for alignments of closely related DNA, indels are responsible for many more unmatched nucleotides than are base substitutions. Human genetic evidence suggests that indels are a major source of gene defects, indicating that indels are a significant source of evolutionary change.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Chromosomes, Artificial, Bacterial / genetics
  • DNA / genetics*
  • DNA, Bacterial / genetics
  • Escherichia coli O157 / genetics
  • Evolution, Molecular*
  • Humans
  • Mutation
  • Pan troglodytes / genetics
  • Polymorphism, Genetic
  • Sea Urchins / genetics
  • Sequence Homology, Nucleic Acid

Substances

  • DNA, Bacterial
  • DNA