Global abundance of short tandem repeats is non-random in rodents and primates

BMC Genom Data. 2022 Nov 3;23(1):77. doi: 10.1186/s12863-022-01092-4.

Abstract

Background: While of predominant abundance across vertebrate genomes and significant biological implications, the relevance of short tandem repeats (STRs) (also known as microsatellites) to speciation remains largely elusive and attributed to random coincidence for the most part. Here we collected data on the whole-genome abundance of mono-, di-, and trinucleotide STRs in nine species, encompassing rodents and primates, including rat, mouse, olive baboon, gelada, macaque, gorilla, chimpanzee, bonobo, and human. The collected data were used to analyze hierarchical clustering of the STR abundances in the selected species.

Results: We found massive differential STR abundances between the rodent and primate orders. In addition, while numerous STRs had random abundance across the nine selected species, the global abundance conformed to three consistent < clusters>, as follows: <rat, mouse>, <gelada, macaque, olive baboon>, and <gorilla, chimpanzee, bonobo, human>, which coincided with the phylogenetic distances of the selected species (p < 4E-05). Exceptionally, in the trinucleotide STR compartment, human was significantly distant from all other species.

Conclusion: Based on hierarchical clustering, we propose that the global abundance of STRs is non-random in rodents and primates, and probably had a determining impact on the speciation of the two orders. We also propose the STRs and STR lengths, which predominantly conformed to the phylogeny of the selected species, exemplified by (t)10, (ct)6, and (taa4). Phylogenetic and experimental platforms are warranted to further examine the observed patterns and the biological mechanisms associated with those STRs.

Keywords: Abundance; Global; Hierarchical clustering; Non-random; Primate; Rodent; Short tandem repeat.

MeSH terms

  • Animals
  • Gorilla gorilla* / genetics
  • Humans
  • Macaca
  • Mice
  • Microsatellite Repeats / genetics
  • Pan paniscus
  • Pan troglodytes / genetics
  • Phylogeny
  • Primates / genetics
  • Rats
  • Rodentia* / genetics