Distinct patterns of simple sequence repeats and GC distribution in intragenic and intergenic regions of primate genomes

Aging (Albany NY). 2016 Sep 16;8(11):2635-2654. doi: 10.18632/aging.101025.

Abstract

As the first systematic examination of simple sequence repeats (SSRs) and guanine-cytosine (GC) distribution in intragenic and intergenic regions of ten primates, our study showed that SSRs and GC displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation. Our results suggest that the majority of SSRs are distributed in non-coding regions, such as the introns, TEs, and intergenic regions. In these primates, trinucleotide perfect (P) SSRs were the most abundant repeats type in the 5'UTRs and CDSs, whereas, mononucleotide P-SSRs were the most in the intron, 3'UTRs, TEs, and intergenic regions. The GC-contents varied greatly among different intragenic and intergenic regions: 5'UTRs > CDSs > 3'UTRs > TEs > introns > intergenic regions, and high GC-content was frequently distributed in exon-rich regions. Our results also showed that in the same intragenic and intergenic regions, the distribution of GC-contents were great similarity in the different primates. Tri- and hexanucleotide P-SSRs had the most GC-contents in the 5'UTRs and CDSs, whereas mononucleotide P-SSRs had the least GC-contents in the six genomic regions of these primates. The most frequent motifs for different length varied obviously with the different genomic regions.

Keywords: GC; Simple sequence repeats; genomic regions; patterns; primate genomes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cytosine
  • DNA, Intergenic*
  • Genome*
  • Guanine
  • Microsatellite Repeats*
  • Primates / genetics*

Substances

  • DNA, Intergenic
  • Guanine
  • Cytosine