Length variation of CAG/CAA triplet repeats in 50 genes among 16 inbred mouse strains

Gene. 2005 Apr 11:349:107-19. doi: 10.1016/j.gene.2004.11.050.

Abstract

CAG repeats coding for poly-glutamines have been studied by many groups as repeat length variations contributes to differences in protein function and disease outcome. In this study, we systematically searched public databases for genes carrying CAG repeats. For the genes obtained, we experimentally analyzed variations of length and the purity of the repeats in 62 loci among 16 inbred mouse strains, including wild-derived and laboratory strains. We found that length was conserved in 50% of the loci, especially among wild-derived strains. Of 496 polymorphic repeat alleles, 78% were uninterrupted and 22% were interrupted with non-CAG codons. Interruptions tended to occur in longer repeats and all repeats of greater length than 23 were interrupted. Although interruptions can act as suppressors for the expansion of CAG repeats, we found that the occurrence of the interruptions depended on the length of the CAG repeats. Furthermore, most poly-glutamines examined in this study existed in human orthologous genes, reflecting the functional significance of poly-glutamines in proteins.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Amino Acid Sequence
  • Amino Acid Substitution
  • Animals
  • Base Sequence
  • Cluster Analysis
  • Codon
  • Conserved Sequence
  • DNA / genetics
  • Databases, Factual
  • Evolution, Molecular
  • Genetic Variation*
  • Mice
  • Mice, Inbred Strains
  • Molecular Sequence Data
  • Phylogeny
  • Polymorphism, Single Nucleotide
  • Proteins / genetics
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Trinucleotide Repeat Expansion
  • Trinucleotide Repeats*

Substances

  • Codon
  • Proteins
  • DNA