Serial segmental duplications during primate evolution result in complex human genome architecture

Genome Res. 2004 Nov;14(11):2209-20. doi: 10.1101/gr.2746604.

Abstract

The human genome is particularly rich in low-copy repeats (LCRs) or segmental duplications (5%-10%), and this characteristic likely distinguishes us from lower mammals such as rodents. How and why the complex human genome architecture consisting of multiple LCRs has evolved remains an open question. Using molecular and computational analyses of human and primate genomic regions, we analyzed the structure and evolution of LCRs that resulted in complex architectural features of the human genome in proximal 17p. We found that multiple LCRs of different origins are situated adjacent to one another, whereas each LCR changed at different time points between >25 to 3-7 million years ago (Mya) during primate evolution. Evolutionary studies in primates suggested communication between the LCRs by gene conversion. The DNA transposable element MER1-Charlie3 and retroviral ERVL elements were identified at the breakpoint of the t(4;19) chromosome translocation in Gorilla gorilla, suggesting a potential role for transpositions in evolution of the primate genome. Thus, a series of consecutive segmental duplication events during primate evolution resulted in complex genome architecture in proximal 17p. Some of the more recent events led to the formation of novel genes that in human are expressed primarily in the brain. Our observations support the contention that serial segmental duplication events might have orchestrated primate evolution by the generation of novel fusion/fission genes as well as potentially by genomic inversions associated with decreased recombination rates facilitating gene divergence.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Cell Line
  • Chromosomes, Human, Pair 17 / genetics*
  • Databases, Genetic
  • Evolution, Molecular*
  • Genome, Human*
  • Humans
  • In Situ Hybridization, Fluorescence
  • Molecular Sequence Data
  • Mutagenesis / genetics*
  • Primates / genetics*
  • Recombination, Genetic / genetics
  • Repetitive Sequences, Nucleic Acid / genetics*
  • Sequence Analysis, DNA

Associated data

  • GENBANK/BI828401