Quantifying the mechanisms for segmental duplications in mammalian genomes by statistical analysis and modeling

Proc Natl Acad Sci U S A. 2005 Mar 15;102(11):4051-6. doi: 10.1073/pnas.0407957102. Epub 2005 Mar 1.

Abstract

A large number of the segmental duplications in mammalian genomes have been cataloged by genome-wide sequence analyses. The molecular mechanisms involved in these duplications mostly remain a matter of speculation. To uncover, test, and further quantify the hypotheses on the mechanisms for the recent duplications in the mammalian genomes, we have performed a series of statistical analyses on the sequences flanking the duplicated segments and proposed a dynamic model for the duplication process. The model, when applied to the human duplication data, indicates that approximately 30% of the recent human segmental duplications were caused by a recombination-like mechanism, among which 12% were mediated by the most recently active repeat, Alu. But a significant proportion of the duplications are caused by some mechanism independent of the repeat distribution. A less sure but similar picture is found in the rodent genomes. A further analysis on the physical features of the flanking sequences suggests that one of the uncharacterized duplication mechanisms shared by the mammalian genomes is surprisingly well correlated with the physical instability in the DNA sequences.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Data Interpretation, Statistical
  • Evolution, Molecular*
  • Gene Duplication*
  • Genomics*
  • Humans
  • Mice
  • Models, Genetic
  • Rats
  • Recombination, Genetic
  • Sequence Analysis, DNA*